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A DOCUMENT ASSEMBLY SYSTEM 
FIELD OF THE INVENTION 

The present invention relates to a document assembly system, and processes executed by 
5 and components of the system. 

BACKGROUND 

Document assembly refers to the generation of an instance document from one or more 
source documents. In general, a source document is a generic template document, and 
additional information specific to the relevant circumstances is required to generate an 

10 instance document from one or more source documents. This- additional information can 
originate from a user and/or some other data source. Document assembly software has 
been developed for . generating documents that typically contain large amounts of common 
text or data with a smaller amount of varying detail text or data. Document assembly 
software is useful because, where a suitable source document exists, it enables instance 

15 documents to be produced more efficiently than may be the case using a standard word 
processor. A form letter is perhaps the simplest and most familiar example of a source 
document, and can be used to generate instance letters for a number of recipients. An 
instance letter is typically generated from a single source document and addressee 
information, such as the addressee's first and last names, title, and address. More complex 

20 instance documents, such as legal or financial documents, can be generated from one or 
more source documents, based on information specific to the parties involved and the 
circumstances of their relationships. 

A source document is represented in a document assembly system in some data format. 
25 Common data formats (only some of which are commonly used for source documents for a 
document assembly system) include plain text, Microsoft's proprietary Microsoft Word 
"doc" format, the rich text format (RTF), portable document format (PDF), and hypertext 
markup language (HTML). A data format which is now being used for a wide variety of 
applications, is extensible markup language, or XML, documented at 




http://www.w3.org/XML . An XML document combines the text of a document with tags 
that markup that text into logical elements. As a data format for storing documents 
generally, XML has a number of advantages over other data formats. In particular, XML 
can be used to markup text in a way that tags it with its meaning or purpose, and 
5 applications can manipulate the text on the basis of these tags. Tools for parsing and 
manipulating XML data are available from a variety of vendors. 

XML allows a document grammar to be defined which an XML document must match if it 
is to be said to be valid with respect to that grammar. If a document is valid, then systems 

10 that can handle documents matching that grammar can manipulate those documents taking 
advantage of the grammar. Such a grammar is often contained within a "document type 
definition" (DTD) or "XML schema". There are many different grammars for XML 
documents that are designed to meet specific needs. For example, the DocBook document 
type definition, documented at http://www.oasis-open.org/docbook/xml/, was designed to 

15 meet very general documentation-requirements. * : - 

A document assembly system preferably performs a number of basic functions. First, it 
determines, on the basis of data provided to it, which parts of a source document to include 
in or exclude from a resulting instance document. For example, a paragraph, sentence or 

20 phrase might only be included in a legal contract if there is a guarantor. Second, the system 
can also include in the instance document text which is not present in the source document. 
For example, a date, an address, or where a user of the system enters a yearly rental, the 
amount calculated to be payable per calendar month. In order to be able to provide these 
two basic functions, a document assembly system stores (i) information as to which parts 

25 of the source document may be included or excluded from the instance document, and (ii) 
information as to the locations in the document in which additional text may be inserted. 

It is also desirable to be able to repeat a passage of text a specified number of times, but 
with different data inserted at certain points within the passage in each repetition. This 
30 requires the ability to identify the passage to be repeated, the number of times to repeat it, 
and the data to be inserted into each repetition. 




Existing document assembly products that work with source documents which are not 
XML based often encode the information described above directly in the source document. 
This is possible with XML source documents as well. The information could be stored as 
5 additional elements or attributes in the XML document itself. However, a serious 
difficulty with this approach is that the document will not be valid unless the grammar is 
altered to allow the inclusion of that information. 

In an alternative approach, taught in United States Patent 6,006,242 (Poole, et al. 

10 * "Apparatus- and method for dynamically creating a document"), entity references are 
embedded in a document instance, and a dedicated entity resolver is used during the 
document assembly process to replace the entity references with text particular to the 
instance document. One problem with this approach is that the source document will not 
validate against the original grammar unless the validating parser being used uses that 

15 dedicated entity resolver. 

Because in each of these approaches the document no longer validates against the original 
grammar using a validating XML parser, the ability to manipulate the document with 3 rd 
party XML-aware applications is significantly curtailed. 

20 

It is desired to provide a document assembly method and system, and a method for 
generating a source document for a document assembly system, that ameliorate one or 
more of the above difficulties, or at least provides a useful alternative. 



25 SUMMARY OF THE INVENTION 

In accordance with the present invention there is provided a document generation system, 
including: 

(i) an insertion component for inserting in an initial document one or more 
processing instructions for determining content of an instance document; and 
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(ii) a generation component for generating a source document by inserting in 
said initial document, one or more references to respective logic sources 
external to said source document, said logic sources including information 
for use in conjunction with said one or more processing instructions to 
5 determine content of said instance document, and said source document 

being valid with respect to a predetermined schema. 

The present invention also provides a method for generating a source document for a 
document assembly system, including: 
10 (i) inserting in an initial document one or more processing instructions for 

determining content of an instance document; and m 

(ii) generating said source document by inserting in said initial document, one 
or more references to respective logic sources external to said source document, said logic 
sources including information for use v in -'conjunction with said one or more processing 

1 5 instructions to determine content of said instance document. 

• i • • « 

Preferably, said step of adding one or more processing instructions includes defining one 
or more conditions for determining content of said instance document. 

20 Preferably, said conditions include conditions for determining whether portions of said 
source document will be included in said instance document. 

Preferably, said method includes adding, to a logic source, one or more logic elements 
determining content of an instance document. 

25 

Preferably, said method includes adding, to a logic source, one or more logic elements 
including interview data for defining one or more questions to a user of said document 
assembly system, and for receiving responses to said questions. 

30 Preferably, said step Of adding one or more processing instructions includes defining one 
or more conditions in one or more of said logic sources, and adding one or more 
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processing instructions to said source document associating said conditions with said 
portions. 

Preferably, said step of adding one or more processing instructions includes adding a 
5 processing instruction indicating where content external to said source document can be 
included in said instance document. 

Preferably, said source document includes an extensible markup language (XML) 
document. 

10 

Preferably, said processing instructions include XML processing instructions (Pis). 

Preferably, said processing instruction includes application data that can be parsed as 
XML. 

is ** • ...... 

Preferably, said portions include XML elements. 

Preferably, said source document is valid with respect to a schema. 

20 By restricting said processing instructions to XML processing instructions, the source 
documents can remain valid with respect to their XML schema or grammar. Furthermore, 
the source documents may be edited using standard XML editing tools without affecting 
their ability to function correctly with the document assembly system, provided the XML 
processing instructions are not altered. Moreover, by separating source XML documents 

25 from their associated logic sources, the logic sources can be shared and used by multiple 
source documents. 

Preferably, said one or more references include respective universal resource indicators 
(URIs). 

30 
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Preferably, said logic sources are represented in extensible markup language (XML). 

Preferably, said logic sources are valid with respect to a schema. 

5 Providing a schema for a source document or a logic source simplifies the task of error 
checking and maintaining validity. 

Preferably, said schema for logic source includes a condition element having an attribute 
of type ID. 

10 

The present invention also provides a document assembly system, including: 

an assembler for generating an instance document on the basis of logic sources and 
a source document, said source document including at least one processing instruction and* 
at least one reference to at least one of said logic sources, respectively % said logic; sources 
1 5 being external to said source document. 

The present invention also provides a document assembly method, including: 

accessing a source document including one or more processing instructions and one 
or more references to respective logic sources external to said source document; and 
20 generating an instance document on the basis of said source document and said 

logic sources. 

Preferably, said processing instructions include one or more conditions, and said step of 
generating includes evaluating said one or more conditions to determine content of said 
25 instance document. 

Preferably, said conditions include conditions for determining whether portions of said 
source document will be included in said instance document. 

30 Preferably, each of said logic sources includes one or more logic elements. 
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Preferably, each of said logic sources includes one or more conditions, and said step of 
generating includes evaluating at least one condition to determine whether a corresponding 
portion of the source document will be included in said instance document. 

5 Preferably, said method includes generating interview display data for displaying one or 
more questions to a user of said document assembly system, and for receiving responses to 
said questions. 

Advantageously, each of said logic sources may include one or more references to 
10 respective other logic sources external to said logic source. 

Preferably, logic elements in said logic sources include interview data, and said step of 
generating includes displaying one or more questions to a user and receiving responses to 
said questions on the basis of said interview data. m . 

15 . : ' ; 

Preferably, said step of generating includes generating assembly data on the basis of said 
responses. 

Preferably, said instance document is generated on the basis of said assembly data. 

20 

Preferably, said step of generating includes accessing said logic sources on the basis of 
said references. 

Advantageously, said step of generating may include accessing one or more other source 
25 documents referenced by said source document. 

Advantageously, said step of generating may include including text from a logic source in 
said instance document. 



30 Advantageously, said step of generating may include including user response text in said 
instance document. 
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Advantageously, said logic source may provide phrases to be used in relation to a party, 
where the party has values for facets such as gender or number which match the facets 
specified for that phrase. 

5 

Advantageously, said step of generating may include including an item one or more times 
in said instance document. Preferably, said item may have a different value for each of said 
times. 

10 Preferably, said step of generating includes evaluating a variable associated with textual 
data and including said textual data in said instance document if said variable has a first 
value and omitting said textual data from said instance document if said variable has a 
second value. 

15 Preferably, said variable is a Boolean variable. 

Preferably, said step of generating includes evaluating a variable including the value of 
said variable in said instance document. 

20 Preferably, said one or more references include respective universal resource indicators 
(URIs). 

Preferably, said source document includes an extensible markup language (XML) 
document. 

25 

Preferably, said processing instructions include XML processing instructions (Pis). 

Preferably, said processing instruction includes application data that can be parsed as 
XML. 

30 

Preferably, said logic sources include extensible markup language (XML) logic sources. 
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Preferably, said source document is valid with respect to a schema. 
Preferably, each of said logic sources is valid with respect to a schema. 

5 

The present invention also provides a system having components for executing the steps of 
any one of the above methods. 

The present invention also provides software having program code for executing the steps 
10 of any one of the above methods. 

The present invention also provides a computer readable storage medium having stored 
thereon program code for executing the steps of any one of the above methods. 

... • '"7. 

15 The present invention also provides a document assembly system, including a processing 
engine for generating an instance document from at least one source document and at least 
one logic source referred to in said at least one source document. 

Preferably, said instance document includes an XML instance document. 

20 

Preferably, said system includes a rendering engine for generating from said instance 
document an output instance document for display in at least one output format. 

Preferably, said at least one output format includes at least one of hypertext markup 
25 language (HTML) format, portable document format (PDF), and rich text format (RTF). 

Preferably, said system includes an editor for editing source documents and logic sources 
on the basis of respective schema. 

30 The present invention also provides a source document for a document assembly system, 
said source document including: 
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one or more processing instructions for determining content of an instance 
document; and 

one or more references to respective logic sources external to said source 
document, said logic sources including information for use in conjunction with said one or 
5 more processing instructions to determine content of said instance document. 

Preferably, said processing instructions include one or more conditions for determining 
content of said instance document. 

10 Advantageously, a condition may depend upon one or more other conditions. Preferably, 
said conditions are related by Boolean operators. 

Preferably," said source document includes an extensible markup language {XML) 
document. 

15 

Preferably, said processing instructions include XML processing instructions (Pis). 

Preferably, said processing instruction includes application data that can be parsed as 
XML. 

20 

The present invention also provides a logic source for a document assembly system, said 
logic source including one or more logic elements for determining content of an instance 
document from a source document including a reference to said logic source. 

25 Preferably, said logic elements include the definition of one or more condition elements for 
determining content of said instance document. 

Preferably, said logic source includes interview data for displaying questions to a user and 
for determining responses to said questions. 

30 
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Advantageously, said logic source may include one or more references to respective other 
logic sources external to said logic source, said other logic sources including information 
for use in determining content of said instance document 

5 Advantageously, a reference to a logic source or a source document may include party 
mapping information for mapping a first party used in the referenced source document or 
logic source to a second party used in the referring source document or logic source. 

Advantageously, said source document may include a processing instruction referencing a 
10 second source document for import into said source document. 

Advantageously, said source document may include a processing instruction for including 
in said instance document text from alogici source. 

15 Advantageously, said source document may include a processing instruction for including 
in said instance document response text provided by said user. 

Advantageously, said source document may include a processing instruction for including 
an element one or more times in said instance document. Preferably, said element may 
20 have different contents for each of said times. 

Preferably, said source document includes an extensible markup language (XML) 
document. 

25 Preferably, said processing instructions include XML processing instructions (Pis). 

Preferably, said XML processing instructions include processing instruction data that can 
be parsed as XML. 

30 The present invention also provides a grammar for a logic source for use with a document 
. assembly system, said grammar defining processing instructions for determining content of 



P \OPER\DB\V\spcai legal 



i^^^Li} ptn doc-l March. 2002 ^^^^ 



- 12- 

an instance document generated from a source document including a reference to said logic 
source. 

Preferably, said grammar includes an extensible markup language document type 
5 definition (DTD). 

BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiments of the present invention are hereinafter described, by way of 
example only, with reference to the accompanying drawings, wherein: 

Figure 1 is a block diagram of a preferred embodiment of a document assembly 
1 0 system connected to a remote computer system via a communications network; 

Figure 2 is a block diagram showing document assembly components of the 

system; 

Figure 3 is a block diagram showing components of an editor of the system; 

Figure 4 is a block diagram of a processing engine of the system; 
1 5 Figure 5 is a flow diagram of a document assembly process used by the system; 

Figure 6 is a screenshot of an interview screen displayed by a web browser during 
the document assembly process; 

Figure 7 is a screenshot of a web browser displaying an instance document in 
HTML format generated by the system; 
20 Figure 8 is a schematic diagram showing the relationships between a base source 

document and reusable document components referenced by the base source document; 

Figures 9 and 10 are schematic diagrams illustrating party mapping performed by 
the system; 

Figure 1 1 is a screenshot of a multiple choice question displayed by a web browser 
25 during the document assembly process; 

Figure 12 is a schematic diagram illustrating the development of document 
assembly files using an editor of the document assembly system; and 

Figures 13 to 15 are screenshots of the editor during development of document 
assembly files. 

30 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

In this specification, a reference to a document includes a reference to entries in a file 
system, database or other library, which can be taken together to represent the document. 
A reference to a document includes a reference to a collection of documents. 

5 

As shown in Figure 1, a document assembly system includes a server 102 having a number 
of modules 106 to 112. The document assembly system can be remotely accessed by a 
user of a computer system 114 via a communications network 116, such as a local area 
network or a wide area network such as the Internet. The modules 106 to 1 12 of the 

10 document assembly system include standard processing modules 106, document assembly 
modules 108, and document assembly data files 110. The system also includes an editor 
112 for creating and editing the data files 110. The standard processing modules 106 
include a web server (such as Apache 1 ? 1 ), an HTTPD servlet container 216,. an. XML parser 
206, and an XSLT engine 207. * 

15 • • * 

As shown in Figure 2, the document assembly modules 108 include a Java controller 
servlet 218, a processing engine 202 and a rendering engine 204. The Java controller 
servlet 218 was created using the Java Development Kit available from Sun 
Microsystems®, and provides an interface between the other document assembly modules 

20 and the servlet container 216. A user of the computer system 114 can access the document 
assembly system using a web browser application executing on the computer 1 14. HTTP 
requests generated by the browser are sent over the network 116 and are received by the 
HTTPD servlet container 216. Access to the document assembly modules 108 is provided 
via the Java Servlet 218, which in turn invokes the processing engine 202 to generate an 

25 instance document on the basis of data files 1 10. The instance document is rendered by the 
rendering engine 204 to generate a rendered document which is sent to the user's web 
browser via the Java controller Servlet 218. 

The data files 110 include source documents 208, logic sources 210, document grammars 
30 212, and logic grammars 214. The data files 110 may also include stored response data 
215. In the described embodiment, the server 102 is a standard computer server such as a 
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Sun Fire® 15K server from Sun Microsystems® and executing a Solaris® operating 
system, and the processing modules 106, 108, and 112 of the document assembly system 
are implemented as software modules stored on hard disk storage 104 of the server 102. 
The data sources 110 are data files also stored on the hard disk storage 104. However, it 
5 will be apparent that the modules of the document assembly system can alternatively be 
distributed over a variety of locations, and that at least part of the modules 106, 108, and 
112 can be implemented as dedicated hardware circuits such as application-specific 
integrated circuits (ASICs). 

10 The document assembly system generates instance documents from one or more generic 
source documents 208 on the basis of information provided by some data source - typically 
a user of the system. This information is provided in the form of responses to questions 
generated by the system on the basis of information contained in logic sources 210. The 
document assembly system uses XML as the data format for representing source. 

15 documents 208 and logic sources 210. In addition to elements used for structuring data; 
XML defines a comment and a processing instruction (PI). A processing instruction is not 
part of a document's usual character data, but is instead passed through to an application. 
Thus a processing instruction can be included in a document without affecting its validity. 
The form of a PI can be represented as follows: 

20 PI •<?' PITarget (S (Char* - (Char* '?>■ Char*)))? '?>' 

The information in the processing instruction is not itself checked by a parser at all (apart 
from ensuring it begins with a string PITarget, used to identify the target application for 
the PI). Moreover, the processing instructions may be placed anywhere in the document. 
The document assembly system uses Pis to separate logic used to generate an instance 

25 document from the source document itself. A source document 208 includes references to 
external logic sources 210 that are required to resolve the logic contained in the source 
document 208 itself. 

A source document 208 is a structured XML document, usually including one or more 
30 processing instructions, as described above. Some of these processing instructions include 
logic components that are used to determine the content of instance documents generated 
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from the source document 208. The logic in a source document 208 is resolved using logic 
from one or more external logic sources 210. A logic source required by a source 
document 208 is identified by a reference (in the form of a universal resource indicator 
(URI)) in a processing instruction. The logic sources 210 are themselves structured XML 
5 documents that define questions, conditions, and reusable text objects. The processing 
engine 202 uses the XML parser 206 to read source documents 208 and logic sources 210. 
A logic source 210 can include one or more to references other logic source documents 
210. The source documents 208 are described by document grammars 212, against which s 
the source documents 208 can be validated. Similarly, the logfc sources 210 can be 
1 0 validated against logic grammars 214. 

The processing engine 202 resolves logic associated with an XML source document 208. 
This is achieved by performing one or more question and answer interview rounds that 
determine which components and text from the source documents 208, referenced logic 
1 5 sources 210 and user responses are'included in the resulting instance document. 

A logic source includes XML elements that are used to determine values for variables 
required by a source document or other logic source referring to that logic source. These 
variable values are usually determined from responses provided by a user of the system in 

20 response to questions defined in one or more logic sources 210 referenced by the source 
document, either directly or indirectly, via a chain of references. The process of displaying 
questions to the user and receiving responses to those questions is referred to as an 
interview, and the data defining these questions and possible responses to them is referred 
to as interview data. A given set of responses to a set of questions can be stored as stored 

25 response data 215 on the hard disk 104 of the document assembly system. The user's 
responses are used to assign values to variables, including setting Boolean variables to a 
value of true or false for determining whether an associated portion of text is to be 
included in a generated instance document. These variables values are referred to as 
assembly data. 
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During the course of resolving all the logic necessary for the rendering engine 204 to be 
able to render the document, the processing engine 202 may modify the DOM 
representation of the source XML document, for example, by physically including XML 
fragments to which the source XML document refers. 

5 

After the processing engine 202 has resolved all the logic necessary for the rendering 
engine 204 to be able to render the document in a form suitable for viewing by humans, the 
rendering engine 204 transforms the source XML document (as modified by the processing 
engine 202), omitting elements subject to a condition found to be false, and including 

10 responses to questions where appropriate into the desired instance format. Initially, the 
rendering engine 204 generates an HTML rendering of the instance document that includes 
controls for further processing of the document. Once an instance document has been 
finalised, the rendering engine 204 is used to generate a final rendering of the instance 
document, as described below. 

15 . _ - 

It will be apparent that in order to generate an instance document in the format requested 
by the user (other than XML), it is not necessary to first generate an instance document in 
XML which is valid with respect to the same grammar as the source XML document. For 
desired instance formats other than XML (XML which is valid against the same grammar 

20 as the source XML document), the source XML document (as modified by the processing 
engine 202) is transformed by the rendering engine 204 into the desired instance format via 
several intermediate XML formats. During the course of those transformations, the 
processing instructions are replaced as appropriate with the information obtained during 
the interview process. 

25 

In an alternative embodiment, the processing engine 202 could produce an instance XML 
document (using the same grammar as the source document or a similar one) for the 
rendering engine 204, or generation of that instance XML document could be an early step 
performed by the rendering engine 204 in the course of generating the instance document 
30 in the format desired by the user. 
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Source documents 208 and logic sources 210 are created and maintained using the editor 
1 12, as shown in Figure 3. In the described embodiment, the editor 1 12 is shown as being 
stored locally on the server 102. However, it will be apparent that the editor 112 can 
alternatively be stored and executed on a remotely located computer system and used to 
5 edit document assembly files at that location, providing that the resulting files are 
* accessible by the parser 206 and processing engine 202 of the system. The editor 112 
includes a user interface module 302, an XML editor module 304, a validation module 
306, and a logic module 308. The user interface 302 and XML editor 304 modules allow a 
user of the document assembly system to add, modify, or delete XML elements, attributes, 

10 comments, processing instructions, and text. The XML editor module 304 provides 
standard XML editor functions plus the ability to add and manipulate condition references, 
reusable text, user text, notes and metainformation processing instructions around and 
within document components. If a processing instruction haying some other purpose were 
added to the system, the XML editor module 304 would be altered to add and manipulate 

15 that sort of processing instruction as well. The validation module 306 provides standard 
XML document validation, as well as logic source validation, and validation of 
interactions between source XML documents 208 and logic sources 210. The logic 
module 308 allows the user to define and manipulate questions, conditions and reusable 
text. It also allows the user to define parameter values that are used to determine default 

20 answer to questions, as described below. The editor 112 allows a user to define and 
manipulate logic within the logic sources 210, manipulate XML documents 208, and add 
logic references to a logic source 210 within an XML document 208. The editor parses 
XML elements using the XML parser 206. The editor 112 uses the document grammars 
212 and logic grammars 214 to determine which elements can be added and deleted, and 

25 how they can be manipulated. Grammars are also used as a basis for determining what 
types of logic can be added. For example, reusable text can only be added to elements that 
are allowed to contain text. 

As shown in Figure 4, the processing engine 202 includes an Evaluator module 402, an 
30 evaluable node package 404, a parties package 406, an interview items package 408, and a 
logic sources package 410. 
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The processing engine 202 is coded in the object-oriented programming language Java. 
The Java language specification defines the notion of a package. 

5 The evaluable node package 404 contains classes that represent a Condition and the 
various elements that are allowed within a Condition (eg And, Or, Not, Test and 
UseCondition). Each of these implement an interface called EvaluableNode. That 
interface defines an evaluate method, which can be invoked in order to determine whether 
the value of the evaluable node is true or false. 

10 

The parties package 406 contains a class which represents a Party which is "known" to the 
source document, a Parties class which represents all known parties, and a class which 
represents a PartyDetail. 

1 5 The interview items package 408 contains classes which represent the questions which the 
evaluator module 402 has encountered, and any responses which may have been provided 
or, for non- n key" questions, worked out automatically. These include 
MultipleChoiceQuestions referred to in a Test, UserTextQuestions, and interview items 
encountered in an ArrayRowIterator. 

20 

The logic sources package 410 represents each logic source used directly or indirectly by 
the source document, and their contents, so that when a reference to a piece of logic is 
encountered, the logic can be retrieved efficiently. 

25 The evaluator module 402 is the collection of classes which orchestrate the evaluation of 
the source document. It controls the evaluation processing, represents the state of the 
evaluation and calls the packages described above as necessary. ^ 

The rendering engine 204 includes an XSLT module which is used for transforming XML 
30 documents, a collection of XSL transformations, and several Xalan extensions which are 
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used to manipulate objects referenced by the processing engine 202's evaluator module. 
XSLT and Xalan are available and documented at http://xml.apache>org/xa]an-i/ . 

The processing engine 202 begins processing an input XML document 208 when its 
5 invocation API is called by an invoking process. In the preferred embodiment, the 
invoking process is the Java controller servlet 218 which invokes the processing engine 
202 in response to a request from the user's web browser passed to it from the HTTPD 
servlet container 216. However, the processing engine 202 could alternatively respond to 
requests from other applications or a Web service. 

10 

The invoking process 412 asks the evaluator module 402 to evaluate the source document. 
While there are interview items for which the evaluator module currently needs a response, 
the evaluator module will return those" interview items to the invoking process. It is the 
responsibility of the invoking process to obtain a response for each of those interview 

15 items via some Interviewltem Resolution Mechanism 414. In the preferred embodiment, 
the Interview Item Resolution Mechanism will usually render those interview items as 
inputs on a HTML form in the user's web browser, although in some cases it will resolve 
the interview item in some other way, for example via a query to a database, or by 
invoking a web service. The invoking process then passes the interview items, together 

20 with their responses, back to the evaluator module 402. 

o 

Figure 5 shows the steps followed when creating an instance document from a source 
XML document using a web browser executing on the user's computer 1 14. At step 502, 
the user selects a suitable source XML document from the repository of documents 208 

25 stored on the document assembly system. At step 504, the user answers setup questions for 
one or more subsequent question or interview rounds. The setup questions include 
specifying whether the user wants to be asked all questions during subsequent interviews, 
or just the "key" questions, in which case the system executes a scoring procedure, 
discussed below, to answer the non-"key" questions with the response having the highest 

30 score. If there is more than one Party used in the source XML document, then the user can 
specify which of the parties the user wishes to favour. This is used during subsequent 
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interview rounds to select default responses that suit the selected party, and is also used by 
the scoring procedure that the system uses to answer questions itself. At step 506, 
successive rounds of interview questions are presented to the user until all the questions 
that need to be answered in order for the system to generate an instance document have 
5 been answered. The system uses multiple rounds of interviews in order to limit the 
number of questions asked to those necessary for a particular situation. In some 
documents, e.g., legal documents, it may not be possible to determine whether a particular 
question needs to be asked until answers have been provided for several other questions. 
The system therefore asks the necessary questions in a series of interview rounds, with the 
10 questions for a particular interview round determined from answers provided in one or 
more previous rounds. 

The processing engine 202 provides the questions to which it requires responses to the 
Java controller servlet 218. It-is the responsibility of that servlet to return the responses to 
15 the processing engine 202. ^ Where it gets those responses from is not the concern of the 
processing engine. Where the questions are put to a user via their web browser, the 
controller servlet could put all the questions to the user in a single HTML form, it could 
put the questions to the user one at a time, or it could group them by topic, asking the 
questions in a single topic in a given form. 

20 

For example, Figure 2 is a screenshot of a web browser display during an interview round 
for generating an instance document that is a letter of employment for a prospective 
employee. The display includes two questions for the user. The first is related to the 
employee's expected work hours. A shaded area 602 displays a question presented to the 

25 user in a previous interview round, together with the user's response. The question asked 
was whether the employer permits flexible working hours in relation to the employee. 
Because the answer previously provided was in the affirmative, subsequent questions 604 
are displayed to the user in order to determine the nature of this flexibility. In this example, 
the questions are to determine the employer's normal office hours and the core working 

30 hours in which the employee is expected to be present. The user enters the appropriate 
answers in text boxes 606 displayed under each question 604. 
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In response to the user selecting a button 608 labelled "next", the processing engine 202 
determines, at step 506, whether there are further questions. If there are no further 
questions, the rendering engine 204 generates at step 510 an instance document in HTML 
5 format on the basis of responses provided by the user, which is then displayed by the web 
browser, as shown in Figure 7. 

The parts of the text that are dependent on answers provided by the user are provided as 
hypertext anchors 702, allowing the user to view the questions and responses that resulted 

10 in the insertion of that text in a popup window 704 by moving their mouse over the text. If, 
at step 512, the user selects the hypertext anchor 702, the processing engine 202 generates 
a new interview round including the relevant question, allowing the user to provide a 
different response to that question. Questions dependent on the new response are then 
presented to the user until the processing engine 202 determined that all the necessary. 

15 information has been provided. A new HTML rendering of the instance XML document is 
then generated at step 510 and displayed by the web browser. 

When the user is satisfied with the instance document, one of a number of buttons 706 
included near the top of the HTML instance document can be selected to generate an 

20 output document. The button selected determines the format of the output format.. For 
example, Figure 7 shows a button 706 for .creating an output document in the Adobe® 
portable document format (PDF). However, the document can alternatively generated in 
HTML (without controls), XML, or in rich text format (RTF). Check boxes 708 -are also 
provided in order to allow the user to determine the content of the output document, 

25 whether it will contain the instance document text, editorial notes pertaining to the use of 
the document, and/or the interview questions and answers on which the document was 
based. A further check box 710 allows the user to select whether the output document will 
be write protected. 

30 Controls 712 are also provided for storing responses on the system as stored response data 
215 and for applying stored responses to the source document they came from, or to 
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another source document that uses some of the same logic sources, and for processing 
groups of related documents. 

It is useful to be able to store the response data, so that if a user wishes to modify some of 
5 the answers they have given in an earlier session, they can start by applying the response 
data to the source document, rather than having to answer all the questions again 
themselves. 

The way this works is that when an answer file containing the response data is loaded, the 
10 questions and answers are displayed to the user in an HTML form. After the user has 
modified the responses as appropriate, and submitted the form (ie clicks "next"), those 
interview items are provided to the evaluator module 402. When, in the course of 
evaluating the source document to which those answers are being applied, an interview 
item is encountered, there will be no need to ask the "Java controller servlet for a response 
15 to it, if the response has already been provided via the answer file mechanism. 

In a similar way, the Java controller servlet can invoke the processing engine 202 applying 
a collection of response data to the several source documents comprising a group of related 
documents, and then invoke the rendering engine 204 to display the resulting instance 
20 documents in the user's browser, or save them to disk. It will be apparent that whether 
each document is processed then rendered, or all documents are processed then all 
rendered, depends on the controller servlet implementation (ie how it invokes the 
processing engine 202 and the rendering engine 204). 

25 Thus the document assembly system provides an efficient, yet flexible means to generate 
and modify instance documents for different circumstances. The source documents 208, 
document grammars 212, logic sources 210 and logic grammars 214 will typically be 
created and maintained by an administrator of the system in accordance with the needs of 
the system's users. For example, a legal firm will create precedents for particular 

30 circumstances, e.g., contracts, as source documents 208, together with their associated 
logic sources 210 and grammars 212, 214. The creation and maintenance of these files 110 
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is performed using the editor 112. The actual assembly of these files 110 into instance 
documents is initiated by end-users of the system accessing the system using a web 
browser. 



5 The document assembly system is described in more detail below with reference to 
examples of the data files 1 10 used by the system. An XML source document used by the 
document assembly system for generating instance letters is presented below in its entirety. 
Portions of this source document are then described in detail to illustrate various features 
of the document assembly system. 

10 

<?xm! version=°1.0" encoding= ,, UTF-8 ,t ?> 

<!DOCTYPE Letter SYSTEM , Tile:///home/speedlega!/dtd/xhtm!-letter/dtd/xhtml-Ietter- 
v2.dtd"> . 
<Letter> .. .... 

15 <?SpeedLegai 

<LogicSources> 

<LogicSource ID= u LogicSource_J l, urt^ r /fiteWiogic/LSl WD.Igc M /> 
- <LogicSource ID="LogicSource_2 M uri= , 7fires/logic/LS2WD.Igc M > 
<PartyMapping> 

20 <PartyMap From-'Contractor" To="Employee"> 

<PartyDetaiiMap From=="Shortname n To^'Firstname"^ 
<PartyDetailMap From="Fullname" To= u Firstname7> 
</PartyMap> 
</PartyMapping> 
25 </LogicSource> 
</LogicSources> 

?> 

<?SpeedLegal 

<Metalnformation> 
30 . <rdf:RDF xmlns:dc= n http://pu^l.org/dc/eIen^lents/1.1/ ,, 

xmlns:rdf= n http://www.w3.org/TRAA/D-rdf-syntax-ns#"> 
<rdf:Description> 
<Metadata> 

<dc:Type Genesis="Originar Status- 'Unapproved" 
35 Use="Precedent7> 

<dc:Title>Letter template</dc:Title> 

<dc:Description>Template business Ietter</dc:Description> 

<dc:Subject><Keyword>Letter</Keyword></dc:Subject> 

<dc:Creator> 

40 <Name>SpeedLegal</Name> 

<Organisation>SpeedLegal</Organisation> 
<md.ContactDetails> 

<md.Address>Level 4, 85 Queen Street, Melbourne, 

Victoria</md.Address> 
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<md.Phone>+61 3 9670 0141</md.Phone> 
<md.Email>info@speedlegal.com</md.Erriail> 
<md.Fax>+61 3 9670 0142</md.Fax> 
</md.ContactDetails> 
5 </dc:Creator> 

<RevisionHistory> 

<Revision><dc:Contributor><Name>SpeedLegal</Name><Organisation>SpeedLeg 
al</Organisation><md.ContactDetails><md.Address>Level 4, 85 Queen Street, 
10 Melbourne, Victoria</md.Address><md.Phone>+61 3 9670 

0l41</mdPhone><md.Email>info@speedlegaLcom</md.Email><md.Fax>+61 3 9670 
0142</md,Fax></md.ContactDetails></dc:Contributor><dc:Date>6/2/2002 
1 3:49</dc:Date><What ChangeType^'BrandNew^Letter creation<AA/hat></Revision> 
</RevisionHistory> 
15 </Metadata> 
</rdf:Description> 
</rdf:RDF> 
</Metainformation> 

?> 

20 <LetterHead> : 

<Date/> "* , 
<Salutation/> 
</LetterHead> - . 

<LetterBody> 
25 <p> 

<?SpeedLegal 

<Condition IDREF-'PaidLeave" LogicSource-'LogicSource_17> 

?> 

<?SpeedLegal 

30 <PartyReference IDREF= n Employee" LogicSource^'LogicSource^T 

Style="Pronoun M Type= H PartyDetaiM 7> 

?> shall be eligible for paid leave.</p> 
<p> 

<?SpeedLegal 
35 <Notes> 

<Note Completionlnstruction-'false" ShowExternalUsers-'false" 
UserLevel="Non-Specialist n > 

<NoteTitle>Paid and Unpaid Leave</NoteTitle> 
<NoteBody><p>AII employees can request <b>unpaid</b> 
40 leave</p><p>Only <b>Full-time</b> or <b>Part-time</b> employees who have 
accrued leave can request <b>paid</b> leave. </p></NoteBody> 
</Note> 
</Notes> 

?> 

45 <?SpeedLegal 

<PartyReference IDREF^'Employee" LogicSource= M LogicSource_1 M 
Style= ,, Firstname ,, Type="PartyDetaiL07> 

?> may request <?SpeedLegal <lnsertReusablePhrase 
IDREF-'PaidOrUnpaidLeave" LogicSource=TogicSource_17>?> by filling in a request 
50 form at least one month prior to the earliest date required.</p> 
<p> 
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<?SpeedLegal <Condition IDREF= u AmountOfLeave.negotiate u 
LogicSource="LogicSource.__17>?>The amount of leave is negotiated with the 
employee.</p> , 
<p> 

5 <?SpeedLegal <Condition IDREF-VVmountOfLeave.award" 

LogicSource= u LogicSource_1"/>?>The amount of leave is determined according to the 
relevant award. </p> 

<p>The employee has the following leave entitlements: <object> 
<table> 

10 <tbody> 

<tr> 

<?SpeedLegal <ArrayRowlterator 
repeat= u NumberOfLeaveTypes7>?> 

<td><?SpeedLegal <Array Reference IDREF=°LeaveType7>?> 

15 </td> 

<td><?SpeedLegal <ArrayReference 
IDREF= M LeaveAmount7>?> 
</td> 

</tr> 

20 </tbody> 

</table> *' 
</object> 
</p> 

<?SpeedLegal 

25 <SmartModule uri^'Vhome/jml/termination.sm'^ 

<PartyMapping> 

<PartyMap From- 'Contractor" To= H Employee M > 

<PartyDetailMap From^'Shortname'' To= n Firstname7> 
<PartyDetailMap From= n FulIname" To="Firstname7> 
30 </PartyMap> 
</PartyMapping> 
</SmartModule> 

?> 

</LetterBody> 
35 <LetterTail> 
<Closing/> 
<Sender/> 
</LetterTail> 
</Letter> 

40 

This source document includes several processing instructions (Pis) that begin with 
"<?SpeedLegal" and end with "?>". The data within the processing instruction, Le. 9 between 
these two strings, resembles one or more XML elements. This allows an application to 
45 receive the data and parse it as XML, with the various benefits which that entails. It will 
be apparent to the skilled addressee that this structure is in conflict with the XSLT 1.0 
specification, which specifically prohibits an XSLT processor from constructing PI data in 
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this form. In what follows, it is convenient to refer to the PI data as if it were an XML 
element. 



Some of the Pis in the above source document include within them a LogicSource XML 

element whose purpose is to specify the location or locations, outside of the XML 

document itself, where the system can find the material necessary to process other 

processing instructions encountered within the document, such as the Condition and 

UserText instructions in the above example. For example, a logic source is identified in the 

above example as follows: 

<LogicSources> ... 

<LogicSource ID^'LogicSourceJ" uri= , 7files/logic/LS1WD.Igc M /> 

The LogicSource sub-element of the LogicSources element specifies a universal resource 
indicator (URI) at which the appropriate logic source may be found, and assigns ah ID 
reference name to it. Many of the processing instructions, throughout the example 
document above contain an IDREF reference to one ofrthe ID reference" names of the listed 
logic sources, which is used to identify the URI of the" corresponding logic source. Thus 
the first logic source referred to in the XML document is located at the URI 
7files/logic/LS1 WD.Igc" and is referred to at other places in the source document by the ID 
reference "LogicSource_r. This logic source is provided below. 



<?xml version="1.0" encoding="UTF-8"?> 

<!DOCTYPE LogicFiie PUBLIC "-//SpeedLegal//SpeedLegal Logic for SmartEditor 

1vO//EN" ,, /home/speedlegal/dtd/logic.dtd ,, > 

<LogicFile> 

<Metalnformation> 
<rdf:RDF> 

<rdf:Description> 
<Metadata> 

<dc:Type Genesis= n Original" Status= n Unapproved M Use^Precedent"^ 

<dc:Title>D</dc:Title> 

<dc:Description>[]</dc:Description> 

<dc:Subject> 

<Keyword>[]</Keyword> 
</dc:Subject> 
<dc:Creator> 

<Name>SpeedLegal</Name> 

<Organisation>SpeedLegal</Organisation> 

<md.ContactDetai!s> 
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<md.Address>Level 4, 85 Queen Street, Melbourne, 
Victoria</md.Address> 

<md.Phone>+61 3 9670 0141</md.Phone> 
<md.Email>info@speedlegal.com</md.Email> 
5 <md.Fax>+61 3 9670 0142</md.Fax> 

</md.ContactDetails> 
</dc:Creator> 
<RevisionHistory> 
<Revision> 

10 <dc:Contributor> 

<Name>SpeedLegal</Name> 

<Organisation>SpeedLegaI</Organisation> 

<md.ContactDetails> 

<md.Address>Level 4, 85 Queen Street, Melbourne, 

1 5 Victoria</md.Address> 

<md.Phone>+61 3 9670 0141</md.Phone> 
<md.Email>info@speedlegal.com</md.Email> 
<md.Fax>+61 3 9670 0142</md.Fax> 
</md.ContaetDetails> 
20 </dc:Contributor> 

<dc:Date>5/2/2002 1 1 :4</dc:£>ate> " . .. 
<What ChangeType= ,, BrandNew ,, > metadata</What> 
</Revision> 

</RevisionHistory> . 
25 </Metadata> 
</rdf:Description> 
</rdf:RDF> 
</Metalnformation> 
<PartiesSetup> 
30 <Party Assessable="true" ID="Employee"> 

<DisplayName>Employee</DisplayName> 

<Role>The person to be offered a position under this letter. </Role> 
<PartyDetails> 

<Party Detail ID="PartyDetaiL0" Name="Firstname M > 
35 <Value Gender="Unspecified H Number="Unspecified"> 

<lnsertUserText IDREF= M EmployeeFirstname7> 
<A/alue> 
</PartyDetail> 

<PartyDetail ID="PartyDetaiM" Name="Pronoun"> 
40 <Value Gender= H Unspecified" Number= M Singular n >He<A/alue> 

<Value Gender-'Female" Number= n Singular ,, >She<A/alue> 
<Value Gender="Unspecified" Number= ,, Plural ,, >They<A/alue> 
</PartyDetail> 
</PartyDetails> 
45 </Party> 

<Party Assessable= ,, true H ID= n Employer n > 
'<DisplayName>Employer</DisplayName> 

<Role>The person who will offer to engage the employee under this 
letter. </Role> 
50 <PartyDetails> 

<PartyDetail ID="PartyDetaiL2 M Name= ,, Name H > 
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<Value Gender= ,, Unspecified u 
Nurnber= M Unspecified">SpeedLegal<A/alue> 
</PartyDetail> 
</PartyDetails> 
5 </Party> 

</PartiesSetup> 

<LogicSources/> 

<LogicSetup> 

<UserTextQuestion Columns= n 20 M ID="LetterSubject M Rows="1" Type= t Text M > 
10 <Question>What is the subject of this letter?</Question> 

<Topic>Subject of Letter</Topic> 
</UserTextQuestion> 

<MultipleChoiceQuestion ID= n Emp!oymentJype n > 
<QuestionType KeyQuestion= n true'7> 
15 <Question>On what basis will <Party Reference IDREF-'Employee" 

Style="Firstname n Type= M PartyDetaiL07> be emp!oyed?</Question> 
<Topic>Position and duties</T opic> 

<SelectionRules AnswerUsing^Defaulf Cardinality-'Single" 
Device= h Checkbox7> 
20 <Responses> '•* Z 

<Response> 

<SelectionCriteria Default= n Checked7> " 
•<Prompt>On a full time basis</Pro"mpt> ; -~ ■ ■ •-■">.•!(:-- 
<SetValueTo>FullTime</SetValueTo> 
25 <Notes> 

<Note Completionlnstruction= ,, false H ShowExternalUsers-'false" 
UserLevel="Non-Specialist"> 

<NoteTitle>Full Time Employees</NoteTitle> 
<NoteBody> 

30 <p>Full time employees are required to work at least the 

hours specified in the award</p> 

<p>They are elligible for maternity/paternity leave as well as 
annual and sick leave</p> 

</NoteBody> 
35 . </Note> 

</Notes> 
</Response> 
<Response> 

<SelectionCriteria Default="Unchecked'7> 
40 <Prompt>On a part time basis</Prompt> 

<SetValueTo>PartTime</SetValueTo> 
</Response> 
<Response> 

<SelectionCriteria Default= M Unchecked7> 
45 <Prompt>On a casual basis</Prompt> 

<SetValueTo>Casual</SetValueTo> 
</Response> 
</Responses> 
</MultipleChoiceQuestion> 
50 <Condition ID="EmploymentJype.FullTime n > 

<Test IDREF^'EmploymentJype" Value= ,, FullTime , 7> 
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</Condition> 

<Condition ID= ,, EmploymentJype.PartTime ,, > 

<TestlDREF= M EmploymenUype n Value="PartTime7> 
</Condition> 

5 <Condition ID="Employment_type.Casuar> 

<Test IDREF= n EmploymentJype M Value="Casual7> 
</Condition> 

<UserTextQuestion Columns="20" ID="Commencement n Rows= ,, 1" 
Type='Text n > 

10 <Question>On what date will <PartyReference IDREF-'Employee" 

Style= n Firstname" Type= n PartyDetaiLO ,, />&apos;s employment 
commence?</Question> 

<Topic>Position</Topic> 
</UserTextQuestion> 

15 <UserTextQuestion Columns- '20 M ID= n NumberQfLeaveTypes M Rows="1 " 

Type= M PositiveInteger n > 

<Question>How many distinct types of leave is the employee entitled 
to?</Question> 

<Topic>Leave TabIe</Topic> 
20 </UserTextQuestion> 

<UserTextQuestion Coiumns= ,, 20" !D= u Leavefype" Rows='T Type= , Text n >. . ■ 
<Question>Type of Leave?</Question> 
<Topic>Leave Table</Topic> - ; - 
</UserTextQuestion> ' - 

25 <UserTextQuestion Columns="20" ID^LeaveAmount" Rows="1 " Type= , Text"> 

<Question>Number of Weeks?</Question> 
<Topic>Leave Table</Topic> 
</UserTextQuestion> 
<Condition ID="PaidLeave"> 
30 <Or> 

<UseCondition IDREF= !, Employment_type.FullTime7> 
<UseCondition IDREF= M Emp!oyment_typePartTime"/> 
</Or> 
</Condition> 

35 <UserTextQuestion Columns="20 u ID="EmployeeFirstname n Rows="1 " 

Type^'Text'^ 

<Question>Please type the <PartyReference IDREF= M Employee n 
Style= ,, Firstname 1 ' Type="PartyDetail_07>&apos;s first name</Question> 
<Topic>Employee Setup</Topic> 
40 </UserTextQuestion> 

<ReusablePhrase ID^'PaidOrUnpaidLeave'^unpaid <ConditionalPhrase 
Condition= M PaidLeave n >or paid</ConditionalPhrase> leave</Reusab!ePhrase> 
<MultipleChoiceQuestion ID= n AmountOfLeave M > 
<QuestionType KeyQuestion= M false7> 
45 <Question>How much leave is <PartyReference IDREF-'Employee" 

Style= n Firstname M Type="PartyDetaiL07> entitled to?</Question> 
<Topic>Setup</Topic> 

<SelectionRules AnswerUsing= t, Bias n Cardinality-'Single" 
Device="Checkbox7> 
50 <Responses> 
<Response> 
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<SelectionCriteria Default="Unchecked"> 

<PartyAssessment Assessment-Best" IDREF="EmpIoyer7> 
</SelectionCriteria> 

<Prompt>Award entitlements</Prompt> 
5 <SetValueTo>award</SetValueTo> 
</Response> 
<Response> 

<SelectionCriteria Default-'Unchecked'^ 

<PartyAssessment Assessment-'Worst" IDREF= n Employer7> 
10 </SelectionCriteria> 

<Prompt>By negotiation with <PartyReference IDREF-'Employer 1 ' 
Style= M Name" Type="PartyDetail_27> 
</Prompt> 

<SetValueTo>negotiate</SetValueTo> 
15 </Response> 
</Responses> 
</MultipleChoiceQuestion> 
<Condition ID-'AmountOfLeave.award n > 

<Test IDREF= M AmountOfLeave n Value="award7> 
20 </Condition> 

<Condition ID= u AmountOfLeave.negotiate"> 

<Test IDREF= n AmountOa.eave M VaIue= M negotidte7> 
</Condition> / 
</LogicSetup> 
25 </LogicFile> 



Because the information necessary to process processing instructions encountered within 
the document are stored outside the document, this information can be stored in an XML 
30 format and can therefore be validated against a DTD. The DTD used by the document 
assembly system for validating logic sources 210 is as follows: 



<!ELEMENT LogicFile (Metalnformation, PartiesSetup, LogicSources, LogicSetup)> 

35 

<!ELEMENT LogicSources (LogicSource)*> 
<!ELEMENT LogicSource (PartyMapping)> 
<IATTLIST LogicSource 
uri CDATA #REQUIRED 

40 > 

<!ELEMENT PartyMapping (PartyMap*)> 
<!ELEMENT PartyMap (PartyDetailMap)+> 
<!ATTLIST PartyMap 

From NMTOKEN #REQUIRED 
45 To ID #REQUIRED 

> 

<!ELEMENT PartyDetailMap EMPTY> 
<!ATTLIST PartyDetailMap 
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From NMTOKEN #REQUIRED 
To ID #REQUIRED 

> 



<!ENTITY % HtmlParagraphMarkup "b | i J a "> 

<!ENTITY % MappablePCDATA "(#PCDATA | PartyReference)*"> 

<!ENTITY % SmartsPhraseLevelDeclarations " ReusablePhrase | UserTextQuestion "> 

<!ENTITY % SmartsPhraseLevellnsertions "| InsertReusablePhrase | 

ConditionalPhrase | lnsertUserText"> 

<!ENTITY % SmartsConditions " Condition IDREF #IMPLIED "> 
<!ENTITY % SmartsldRequired "ID ID #REQUIRED"> 
<!ENTITY % Smartsldlmplied "ID ID #IMPLIED"> 
<!ENTITY % SmartslDREF "IDREF IDREF #REQUIRED "> 
<!ENTITY % SmartlDREF "IDREF IDREF #REQUIRED "> 

<!ENTITY % SmartsPCDATA "(#PCDATA | InsertUserText | InsertReusablePhrase I 
ConditionalPhrase )*"> 

<!ELEMENT Metalnformation (rdf:RDF)> 

<!ELEMENT rdf:RDF (rdf:Description)> 

<!ATTLIST rdftRDF 

xmlns.rdf CDATA #FIXED "http://www.w3.Org/TR/WD-rdf-syntax-ns#" - • 
xmlns:dc CDATA#FIXED "http://purl.Org/dc/elements/1.1/" - " ■. 

<!ELEMENT rdf: Description (Metadata)> . ' \- . 

<!ELEMENT Metadata (dc:Type, dc:Title, dc.Source?^ ::dc:Description, dc:Subject, 
dc:Creator, dc:Publisher?, dc:Relation*. RevisionHistory, PlannedEnhancements'?)> 
<!ELEMENT dc:Type EMPTY> 

<!ATTLIST dc:Type 

Genesis (Original | Copy | Link) "Original" 
Use (Precedent | Transaction) "Precedent" 
Status (Unapproved | Approved) "Unapproved" 

> 

<!ELEMENT dc:Title (#PCDATA)> 
<!ELEMENT dc:Source (#PCDATA)> 
<!ELEMENT dc:Description (#PCDATA)> 
<!ELEMENT dc:Subject (Keyword*)> 
<!ELEMENT Keyword (#PCDATA)> 

<!ELEMENT dc:Creator (Name, Organisation, md.ContactDetails)> 
<!ELEMENT Name (#PCDATA)> 
<!ELEMENT Organisation (#PCDATA)> 

<!ELEMENT md.ContactDetails (md.Address | md.Phone I md.Email I md.Fax)+> 
<!ELEMENT md.Address (#PCDATA)> 
<!ELEMENT md.Phone (#PCDATA)> 
<!ELEMENT md.Email (#PCDATA)> 
<!ELEMENT md.Fax (#PCDATA)> 

<!ELEMENT dc:Publisher (Name, Organisation, md.ContactDetails)> 

<!ELEMENT dc:Relation (UsedBy?, MatterNumber?, Client?, Other*)> 

<!ELEMENT UsedBy (#PCDATA)> 

<! ELEMENT MatterNumber (#PCDATA)> 

<!ELEMENT Client (#PCDATA)> 



P:\OPER\D BWVspcaflcssJ^BEmbly prov.dcc-1 March. Tirol 



-32- 



<!ELEMENT Other (Field, FieldValue)> 
<!ELEMENT Field (#PCDATA)> 
<!ELEMENT FieldValue (#PCDATA)> 
<!ELEMENT PlannedEnhancements (Enhancement*)> 
5 <!ELEMENT Enhancement (dc: Contributor, dc:Date, What)> 
<!ELEMENT RevisionHistory (Revision*)> 
<!ELEMENT Revision (dc: Contributor, dc:Date, What)> 
<!ELEMENT dc:Contributor (Name, Organisation, md.ContactDetails)> 
<!ELEMENT dc:Date (#PCDATA)> 
10 <!ELEMENT What (#PCDATA)> 
<!ATTLIST What 

ChangeType (MajorRework | MinorChanges | BrandNew | ExtraContent | Update) 
"BrandNew" 
> 

15 <!ELEMENT PartiesSetup (Party*)> 

<!ELEMENT Party (DisplayName, Role, PartyDetails)> 
<!ATTLIST Party 

%SmartsldRequired; 
Assessable (true | false) "true" 

20 > 

<!ELEMENT DisplayName (#PCDATA)> 

<!ELEMENT Role %MappablePCDATA;> . .. 3 - 
<!ELEMENT PartyDetails ( Party Detail*) 5 " 
<!ELEMENT PartyDetail (Value+)> 
25 <!ATTLIST PartyDetail 

Name NMTOKEN #REQUIRED 
%SmartsldRequired; 

> 

<!ELEMENT Value %SmartsPCDATA;> 
30 <!ATTLIST Value 

Gender (Male | Female | Neuter | Unspecified) "Unspecified" 
Number (Singular | Plural | Unspecified) "Unspecified" 

> 

<!ELEMENT PartyReference EMPTY> 
35 <!ATTLIST PartyReference 
%SmartlDREF; 
Type IDREF #REQUIRED 
Style NMTOKEN #IMPLIED 

> 

40 <!ELEMENT LogicSetup (Condition | MultipleChoiceQuestion | 
%SmartsPhraseLevelDeclarations;)*> 

<!ENTITY % booleanSubstitute " Test | UseCondition j And | Or | Not "> 
<!ELEMENT Condition (True | False | %booleanSubstitute;)> 
<!ATTLIST Condition 
45 %SmartsldRequired; 
> 

<!ELEMENT True EMPTY> 
<!ELEMENT False EMPTY> 
<!ELEMENT Test EMPTY> 
50 <!ATTLIST Test 

uri CDATA #IMPLIED 
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%SmartslDREF; 

Value CDATA #REQUIRED 

> 

<!ELEMENT And ((%booieanSubstitute;), (%boo!eanSubstitute;))> 
5 <!ELEMENT Or ((%booleanSubstitute;), (%booleanSubstitute;))> 
<!ELEMENT Not (%booleanSubstitute;)> 
<!ELEMENT UseCondition EMPTY> 
<!ATTLIST UseCondition 
uri CDATA #IMPLIED 
10 %SmartslDREF; 
> 

<!ELEMENT Question %MappabIePCDATA;> 
<!ELEMENT Topic %MappabiePCDATA;> 

15 <!ELEMENT . MultipleChoiceQuestion (QuestionType, Question, Topic, Notes?, 
SelectionRules, Responses)> 
<!ATTLIST MultipleChoiceQuestion 
%Smarj:sldRequired; 

> 

20 <!ELEMENT QuestionType EMPTY> 

<!ATTLIST QuestionType ^ _ . ' 

KeyQuestion (true | false) "true" : "~ • " - C - 

<IELEMENT Responses (Response, Response+)> 
25 <!ELEMENT SelectionRules EMPTY> 
<!ATTLIST SelectionRules 

Device (Checkbox | DropdownList) "Checkbox" 

AnswerUsing (Default | Bias) "Bias" 

Cardinality (Single | Multiple) "Single" 

30 > 

<!ELEMENT Response (SelectionCriteria, Prompt, SetValueTo, Notes?)> 
<!ELEMENT SelectionCriteria (PartyAssessment)*> 
<!ATTLIST SelectionCriteria 

Default (Checked | Unchecked) "Unchecked" 

35 > 

<!ELEMENT PartyAssessment EMPTY> 
<!ATTLIST PartyAssessment 
%SmartslDREF; 

Assessment (Worst | Bad | Neutral | Good | Best) "Neutral" 

40 > 

<!ELEMENT Prompt %MappablePCDATA;> 
<IELEMENT SetValueTo (#PCDATA)> 



45 <IELEMENT UserTextQuestion (Question, Topic, DefaultText?, Notes?)> 
<!ATTLIST UserTextQuestion 
%SmartsldRequired; 

Columns (1 | 2 | 3 | 4 | 5 | 10 1 15 | 20 | 25 | 30 | 40 | 50 | 60 | 70) "20" 
Rows (1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 12 | 14 | 15 | 16 | 18 | 20 | 25 | 30) "1" 
50 Type (Text | Duration | Time | Date | Decimal | Integer) "Text" 

> 
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<!ELEMENT DefaultText (#PCDATA)> 
<!ELEMENT InsertUserText EMPTY> 
<!ATTLIST InsertUserText 
%SmartlDREF; 

5 > 

<!ELEMENT ReusablePhrase (#PCDATA %SmartsPhrasel_evellnsertions; )*> 
<!ATTLIST ReusablePhrase 
%SmartsldRequired; 

10 > 

<!ELEMENT InsertReusablePhrase EMPTY> 
<!ATTLIST InsertReusablePhrase 
%SmartlDREF; 

> 

15 <!ELEMENT ConditionalPhrase (#PCDATA | PartyReference 
%SmartsPhrasel_evellnsertions; )*> 
<!ATTLIST ConditionalPhrase 
%SmartsConditions; 

> 

20 

<!ELEMENT Notes (Note)*> 
<!ELEMENT Note (NoteTitle, NoteBod*y)> 
<!ATTLIST Note 

UserLevel (Specialist | Non-Specialist) "Non-Specialist" 
25 ShowExternalUsers (true | false) "false" 

Completionlnstruction (true | false) "false" 

> 

<!ELEMENT NoteTitle (#PCDATA)> 

<!ELEMENT NoteBody (p)+> 
30 <!ELEMENT p (#PCDATA | %HtmiParagraphMarkup;)*> 

<!ELEMENT b (#PCDATA)> 

<!ELEMENT i (#PCDATA)> 

<!ELEMENT a (#PCDATA)> 

<!ATTLIST a 
35 href CDATA #REQUIRED 

target CDATA #FIXED "_blank"> 



Another advantage of separating logic source files 210 from document source files 208 is 
40 that that the logic source files 210 can also be re-used in conjunction with other XML 
documents, simplifying maintenance. For similar reasons, it is convenient to store logic 
information in separate files rather than, for example, one monolithic logic file. This 
allows related information to be grouped together, and only the logic components that are 
required need to be re-used with a given source document. 

45 
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It will be apparent that this information could alternatively be stored in a database rather 
than separate XML files. Separate XML files are preferred because they can be easily 
arranged in a hierarchical file system. However, if a logic source which is in use is moved, 
then a base XML document that refers to the logic source will include an incorrect URI, 
5 and will need to be updated. In an alternative embodiment, the repository 208 in which the 
source documents are stored detects the fact that the logic source has been moved, and 
updates the references to that logic source automatically, or alternatively, advises a URI 
resolver (the purpose of which is to translate old locations to actual locations) of the new 
location. This approach can be implemented with file systems where extra code can be 
10 executed automatically when a file is copied. Many WebDAV server implementations are 
provided with source code, which would enable a document assembly system which used a 
webDAV server as its repository 208 to implement this. 

Another alternative would be to;have a single resource in the document assembly system 
15 which authoritatively associated the name of a logic source Avith its location, so that in a 
reference to a logic source, a source document contained only the name of the resource 
known to the system (together with the party mapping being used, as to which see further 
below). In this implementation, that single resource would need to be updated when a 
logic source was moved. 

20 

A logic source can refer to one or more other logic sources. For example, Figure 8 shows a 
first logic source 802 that builds additional logic on top of that available in second 804 and 
third 806 logic sources. The third logic source 806, in turn, builds additional logic on top 
of a fourth logic source 808. However, the base XML document 812 only refers to the first 

25 802 and second 804 logic sources. Thus a convention is needed as to the "visibility" of 
logic. The convention used by the preferred embodiment is that the only logic sources 
visible in a particular document (either an XML source document or a logic source) are 
those declared in that documents <LogicSources> element. Consequently, if the base 
XML document 812 needed to use a condition defined in the third logic source 806, it 

30 would need to import that logic source in its <LogicSources> element. This would be 
achieved by an administrator editing the source document using the editor 1 12. 
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Figure 8 also shows the base XML document 812 reusing an XML fragment 814 by 
reference. A "SmartModule 11 processing instruction is used for this purpose because use in 
the source document of the standard XML inclusion mechanism Xinclude, documented at 
www.w3.org/TR/xinclude/, would invalidate the source document unless the grammar for 
the source document specifically allowed its use. In Figure 8, the XML fragment 814 
imports a fifth logic source 816. The latter is not visible to the base XML document 812. 

Assembled documents often refer to one or more parties. For example, a legal document 
might refer to the parties involved in an agreement of some kind. In such cases there are 
often only two parties involved, but in some circumstances there may be many parties. A 
logic source specifies the parties known to it, in its PartiesSetup element. 

A party mapping is a child on a logic source import that allows parties used in a logic 
source to be mapped to parties used in a document importing the- lpgic source. This makes . 
it possible for a logic source which, for example, refers to a Plumber and a. Homeowner, to 
be used in a document between a Banker and a Customer. 

A party mapping chain is a chain of party mappings between a base XML document and a 
logic source. It will be evident that a party mapping chain can swap two parties around 
e.g., if the LogicSource import statement in the base XML document 812 for the first logic 
source 802 said, in effect, "Employer becomes Employee and Employee becomes 
Employer". Thus two references to the same physical logic source are regarded as logically 
the same if and only if their respective party mapping chains result in the same party 
mapping. 

It is not an error to import the same logic source twice in a single LogicSources statement 
(allocating each occurrence a distinct ID). This allows one physical question or condition 
to be used for two logical questions by defining different party mappings in each 
LogicSource import statement. 

In addition to those described above, the example source document above includes seven 
other types of processing instruction: 
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(i) Condition 

(ii) InsertReusablePhrase 

(iii) InsertUserText 

(iv) PartyReference 

5 (v) ArrayRowlterator 

(vi) SmartModuIe 

0 

(vii) Notes 

(viii) Metal nformation 

10 For half of these, there is a correspondence between a processing instruction in a source 
XML document, and an element in a referenced logic source. That is, when software 
manipulates or interprets the processing instruction, it does so in conjunction with the 
corresponding element in the logic source. This correspondence is as follows: 



Processing Instruction 


LogicSource 


Condition 


Condition 


InsertReusablePhrase 


ReusablePhrase 


InsertUserText 


UserTextQuestion 


PartyReference 


Party 



15 

ArrayRowlterator, SmartModuIe, Notes and Metal nformation are exceptions to this - an 
ArrayRowlterator is independent, as is Notes and Metal nformation, while a SmartModuIe 
has a correspondence with a fragment of XML stored at some other URL 

20 A Condition processing instruction specifies by convention that its parent element should 
only be included in the instance document if the associated condition evaluates to true. For 
example, the following condition is taken from the example source document given above. 
<p> 

<?SpeedLegal <Condition IDREF= H AmountOfLeave.negotiate n 
25 LogicSource="LogicSource_17>?>The amount of leave is negotiated with the 

employee.</p> 

This means that the <p> element will only be included in an instance document generated 
from this source document if the Condition AmountOfLeave. negotiate in the logic source 
30 named "LogicSource_r is true. It will be apparent that alternative embodiments could be 
developed to implement the condition is other ways. For example, the Condition could by 
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convention apply to the following sibling, or Condition open and Condition close 
processing instructions could be used to surround the XML to which the condition is to 
apply. However, neither of these approaches is preferred. In the first case, there may be 
no following sibling. In the second case, if the open and close processing instructions are 
5 not properly nested so that they precede and follow, respectively, a matching pair of tags, 
then the resulting instance document would not be well-formed. Similarly, the syntax of 
the Condition PI could have been written in various ways, some of which include data that 
looks like XML, and others that do not, including: 

10 (i) <?SpeedLegal <Condition IDREF= ,, AmountOfLeave.negotiate H 
LogicSource="LogicSource_1 "/>?> 

(ii) <?SpeedLegal <Logic type="condition" IDREF= n AmountOfLegve.negotiate n 
LogicSource="LogicSource_1 "/>?> 

(iii) <?Condition <data IDREF="AmountOfLeave.negotiate ,, _ " 
15 LogicSource= n LogicSource_J7>?> 

(iv) <?Condition IDREF^AmountOfLeave.negotiate" 
LogicSource="LogicSource_T?> ./[ _ - ; _ _ ; 

(v) <?Condition IDREF= M AmountOfLeave.negotiate"; ■ 
LogicSource="LogicSource_1 "?> 

20 (vi) <?Condition LogicSource_1#AmountOf Leave. negotiate"?> 
However, the first of these is the form used in the preferred embodiment. 

The IDREF attribute of a condition PI specifies the ID of the condition to be tested; the 
25 LogicSource attribute is an IDREF to the logic source in which the condition is defined. 
For example, the logic source file with an IDREF of "LogicSource_l H and listed above 
contains the definition of the AmountOfLeave.negotiate condition, as follows: 



<Condition ID="AmountOfl_eave. negotiate 1 ^ 
30 <Test IDREF^'AmountOfLeave" Value= n negotiate7> 

</Condition> 

This means that the condition will evaluate to true if the response given for the question 
named (JLe. 9 with an IDREF equal to) "AmountOfLeave" has the value "negotiate". The 
35 question "AmountOf Leave" is a multiple choice question defined in the logic source, as 
follows: 
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<MultipleChoiceQuestion ID="AmountOfLeave ,, > 
<QuestionType KeyQuestion="false7> 

<Question>How much leave is <PartyReference IDREF="EmpIoyee M 
5 Style= ,, Firstname ,, Type= M PartyDetaiLO ! V> entitled to?</Question> 

<Topic>Setup</Topic> 

<SelectionRules AnswerUsing="Bias" Cardinality="Single" 

Device="Checkbox7> 
<Responses> 
10 <Response> 

<SelectionCriteria Default="Unchecked"> 

<PartyAssessment Assessment= M Best ,, IDREF="Employer7> 
</SelectionCriteria> 

<Prompt>Award entitlements</Prompt> 
15 <SetValueTo>award</SetValueTo> ■ 

</Response> 
<Response> 

<SelectionCriteria Default="Unchecked"> 

<PartyAssessment Atssessment^Worst' IDREF=s"Employer7> 
20 </SelectionCriteria> 

<Prompt>By negotiation with' <PartyReference IDREF="Employer n 

Style= ,, N^me M Type="PartyDetaiL27> 
</Prompt> '* ...... 

<SetValueTo>negotiate</SetValueTo> 
25 </Response> 
</Responses> 
</MultipleChoiceQuestion> 



30 In this example, the MultipleChoiceQuestion element's QuestionType child has value 
"false" for attribute KeyQuestion. This means that the processing engine 202 is free to 
attempt to answer this question itself (i.e., without interacting with any external agent) 
using its scoring procedure if the user or software invoking the Processing Engine 202 
indicated they only wished to be asked Key Questions. 

35 

If the SelectionRules element has value "Bias" for the AnswerUsing attribute, as is the case 
here, then the processing engine 202 scores each response based on the values contained in 
its SelectionCriteria child and which party the user or software invoking the Processing 
Engine indicated they wish to favour. Alternatively, if the SelectionRules element has 
40 value "Default" for the AnswerUsing attribute, then the processing engine 202 selects the 
response for which the SelectionCriteria element's "Default" attribute has value "Checked". 
For example, Figure 1 1 shows a screenshot of a web browser window displaying this 
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question in HTML generated by the Java Controller servlet 218 of the document assembly 
system. 

In the above example, the MultipleChoiceQuestion is defined in the logic source that also 
5 defines the condition. However, a definition in a different logic source can be used, 
assuming that different logic source was declared in the LogicSources element (of the 
logic source which defines the condition), by including its IDREF name (z.e., the name 
assigned to it in the logic sources element) in the Test element, as follows: 

10 <Test IDREF-' AmountOfLeave " Value= M negotiate " LogicSource="LogicSource804" /> 

A condition can be defined in terms of other conditions, using boolean logic operators. In 
the example, the Condition PaidLeave is defined as follows: 

15 

<Condition ID=''PaidLeave n > 
<Or> 

<UseCondition IDREF^'EmploymentJype.FullTime'V^ 
<UseCondition IDREF= n EmploymentJype.Parrnme7> 
20 </Or> 

</Condition> 

This condition will be true if either the condition EmploymenMype.FulITime is true or the 
condition EmploymenMype.PartTime is true. 

25 

It will be evident that condition Pis as described above can only be attached to elements. 
The question arises - what if you wish to attach a condition to some text which is not 
tagged (or you do not wish to tag, or because of the grammar, cannot tag) with an element 
to which a condition can be attached. There are several possible solutions to this. One 
30 possibility would be to define opening and closing Condition Pis that surround the text in 
question. However, this approach has drawbacks already adverted to above. In the 
preferred embodiment, the text is defined in a logic source, and an "InsertReusablePhrase" 
processing instruction is used to refer to it in the source document. 

35 There is an InsertReusablePhrase in the example XML document above: 
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<p>..may request <?SpeedLegal <lnsertReusablePhrase 

IDREF="PaidOrUnpaidLeave ,r LogicSource="LogicSource_17>?> by filling in a request 
form at least one month prior to the earliest date required.</p> 

5 The corresponding ReusablePhrase with ID "PaidOrllnpaidLeave" is defined in the logic 
source as follows: 

<ReusablePhrase ID= n PaidOrUnpaidLeave">unpaid <ConditionalPhrase 
Condition="PaidLeave">or paid</ConditionalPhrase> leave</ReusablePhrase> 
10 * 

Thus a ReusablePhrase can contain not only text, but other things which are evaluated. 
One of these is a "ConditionalPhrase". This is an element which is defined in the logic 
source DTD, mainly for the purpose of allowing a condition to be attached. In this way a 
ReusablePhrase can be used to conditionally insert text into the XML document in the 
15 absence of an element to which a condition could otherwise be attached. In alternative 
embodiments, a condition could be attached directly to the ReusablePhrase element as an 
attribute, or a new processing instruction defined which specifies both a condition and the 
text to be inserted if the condition is true. 



20 Information entered by a user in response to a question can also be inserted into an 
instance document. For example, the source document listed above includes the following 
processing instruction: 

<?Speedl_egal 

25 <lnsertUserText IDREF^'LetterSubject" LogicSource="LogicSource_17> 

?> 

The rendering engine 204 processes this instruction by inserting, in the instance document, 
the value (/.<?., the response provided by the user) of the user text question with ID 
30 "LetterSubject" that is defined in the logic source "LogicSource_1". 

That user text question is defined in the logic source as follows: 

<UserTextQuestion Columns="20" ID= n LetterSubject" Rows='T' Type='Text"> 
35 <Question>VYhat is the subject of this letter?</Question> 
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<Topic>Subject of Letter</T opic> 
</UserTextQuestion> 

The type attribute restricts the allowed type of input to the specified format, which is one 
5 of text, duration, time, date, decimal, or integer. In this example, the user may enter free 
text. 

In the preferred embodiment, MultipleChoiceQuestion and UserTextQuestion are distinct 
elements, even though their content shares some common characteristics (e.g., a question). 
10 In an alternative embodiment, the common characteristics are abstracted out in the DTD, 
as follows: 

<!ELEMENT Interviewltem (question, Topic, Notes?, ( MultipleChoiceQuestion | 

UserTextQuestion )) > ' ! " 

15 " - ' - 

Indeed, the Interviewltems package 408 of the Processing- Engine 202 of the : preferred 

embodiment uses just this model - an abstract class called Interviewltem captures the 

commonalities between MultipleChoiceQuestion, UserTextQuestion, etc, with subclasses 

that represent the specifics of each case. 

20 

Moreover, although UserTextQuestion is implemented with a type attribute in the 
preferred embodiment, the different types could alternatively be implemented as elements 
in their own right, in which case the Interviewltem element declaration might look like: 

25 <!ELEMENT Interviewltem (question, Topic, Notes?, ( MultipleChoiceQuestion | 
UserTextQuestion | Duration | Time | Date | Decimal | Integer )) > 



A party reference processing instruction is used by the processing engine 202 to generate 
30 text referring to a party. For example, the following: 

<?SpeedLegal 

<PartyReference IDREF^'Employee 11 LogicSource= ,, LogicSource_1 n 
Style= n Pronoun" Type="PartyDetaiM 7> 

35 ?> 
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in a source XML document will result in the insertion, into the instance document, of the 
value of the Party Detail whose ID corresponds to the Type attribute for that party. In this 
example, the relevant party is "Employee". 

5 

<Party Assessable="true" ID="Employee"> 
<DisplayName>Employee</DisplayName> 

<Role>The person to be offered a position under this letter. </Role> 
<PartyDetails> 

10 <PartyDetail ID="PartyDetaiLP" Name="Firstname"> 

<Value Gender="Unspecified" Number="Unspec1fied ,, > 

<lnsertUserText IDREF="Emp!oyeeFirstname7> 
<A/alue> 
</PartyDetail> 

15 <Party Detail ID^'PartyDetaiM" Name="Pronoun"> _ : 

<Value Gender="Unspecified" Number="Singular">He<A/alue> 
<Value Gender-'Female" Number="SinguIar">She</Value> 
<Value Gender="Unspecified" Number="Plural">They<A/alue> 
</PartyDetail> 
20 </PartyDetails> 
</Party> 

It can be seen from this example that the PartyDetail can contain values for different 
Gender and Number. Assuming the Gender and Number of the party has been ascertained 
25 (in this implementation a question .to do so is automatically generated by the processing 
engine 202 when required), the corresponding value can be evaluated by the Processing 
Engine 202, and inserted by the Rendering Engine 204, as described below. 

It will be evident that other attributes could be included on the PartyDetail Value children, 
30 for example, "Formal", which would be useful with some languages other than English. 



The example document contains a processing instruction specifying an ArrayRowlterator: 
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<tr> 

<?SpeedLegal <ArrayRowlterator repeat= ,, NumberOfLeaveTypes'7>?> 
<td><?SpeedLegaI <ArrayReference !DREF="LeaveType7>?> 

5 If an XML element has an ArrayRowlterator child, then that element (<tr> in this example) 
will be copied into the instance document one or more times. When the source document 
is processed by the Processing Engine 202, the processing engine 202 determines how 
many times that element is to be included in the instance document. To determine this, it 
uses a "repeat" attribute. If the value of the repeat attribute is a positive integer, then the 
10 element is included that many times. Otherwise, the value of the repeat attribute is the 
name of an interview item that can only take an integer response (preferably any limit on 
the range would be specified in that interview item). 



For each time it is copied in, a distinct yalue is required for each Interview Item referred to 
15 in an ArrayReference processing instruction. For example, if the value of the repeat 
attribute was 2, then the first time the tr element appeared (i.e., the .first. row), the 
UserTextQuestion "LeaveType" might have value "Annual Leave"; and the second time 
(i.e., the second row), "Maternity Leave". 



20 The Interviewltems package 408 contains a class which represents an ArrayRowlterator. 
The ArrayRowlterator class contains a collections object which contains 
UserTextQuestions and MultipleChoiceQuestions for each row. When the 
ArrayRowlterator is passed to the invoking process to be answered, the UserTextQuestions 
and MultipleChoiceQuestions contained within it get asked as necessary. 

25 

In the preferred embodiment, there is no corresponding element in any logic source for an 
ArrayRowlterator processing instruction. In other words, nothing is needed in the any of 
the logic sources (beyond ordinary Interview Items ie MultipleChoiceQuestions and 
UserTextQuestions) which are referenced by the ArrayReference processing instructions). 

30 
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It is often desirable to use an Interview Item, within an element which has an 
ArrayRowlterator child, that takes the same value each time the element is repeated (in 
other words, such an Interview Item does not get asked for each row). 

5 Consequently, the preferred embodiment uses the ArrayReference processing instruction 
seen above, rather than assuming that any interview item that occurs within an element 
with an ArrayRowlterator child is to be treated as an array of questions (ie asked again for 
each row). If that assumption were valid, then there would not be any need for an 
ArrayReference processing instruction. 

10 

An alternative embodiment which did not make this assumption but also did not use an 
ArrayReference processing instruction might include an attribute on each Interview Item 
which indicated whether in an array the interview item was asked just once, or once each 
array iteration. For example, that attribute might .be named ArrayRowIteratorBehavior fc 
15 with allowed values "variant" or "invariant", and appear on* the Question child of a 
UserTextQuestion or MultipleChoiceQuestion. 

A SmartModule processing instruction is used in a source document to indicate that a 

fragment of XML at a specified URI is to be retrieved when the source document is run 

20 through the processing engine 202. For example, the following is taken from the example 

source document: 

<?SpeedLegal 

<SmartModuIe uri=7home/jml/termination.sm"> 
<PartyMapping> 

25 <PartyMap From="Contractor" To="Employee n > 

<PartyDetaiIMap From= ,l Shortname" To="Firstname"/> 
<PartyDetai!Map From="Fullname" To="Firstname7> 
</PartyMap> 
</PartyMapping> 
30 </SmartModule> 
?> 

m i 

The PartyMapping is used to translate PartyReferences in the XML fragment referenced 
by the SmartModule PI to references to party details on parties which are known to the 
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source XML document. For example, consider a base or main XML document 812 that 
refers to parties PI, P2, and E, as shown in Figure 9. The XML document 812 references 
the first logic source 802 which contains references to parties PI and P2. The XML 
document 812 also references a second logic source 804 that contains references to parties 
5 A and B only. The base document 812 also references a third logic source 806 that 
contains references to parties C, D, and E only. However, the third logic source 806 itself 
refers to a fourth logic source 808, containing references to parties M and N. The third 
logic source 806 (which defines parties C, D and E) needs to use logic contained in the 
fourth logic source 808. However, the only party names "known" to the fourth logic 

10 source 808 are M and N. To allow the logic referring to parties M and N in the fourth logic 
source 808 to be used for parties C and D, the reference to the fourth logic source 808 in 
the third logic source 806 includes party mapping instructions to map any references to 
party M in logic imported from the fourth logic source 808 to refer instead to party C. . 
Similarly, party N is mapped to party D. By performing this party mapping; the third logic 

15 source 806 is able to use the logic contained in the fourth logic source 808 in statements 
referring to parties known only to the third logic source 806. 

Referencing and party mapping can be nested. In order for the main XML document 812 to 
make use of the logic contained in the second logic source 804 and the third logic source 

20 806, the main document 812 includes party mapping instructions to map parties A and B of 
the second logic source 804 to parties PI and P2, respectively. For the third logic source 
806, parties C and D are mapped to parties PI and P2, respectively; because there is no 
mapping for party E, it becomes "known" to the main XML document). However, because 
the third logic source 806 contains a reference to the fourth logic source 808, the logic 

25 imported from the fourth logic source 808 effectively undergoes two levels of party 
mapping, referred to as a party mapping chain. That is, references to parties M and N in 
logic from the fourth logic source 808 that were already mapped to C and D as described 
above are now mapped again, in this case to parties PI and P2, respectively, when 
imported into the main XML document 812. 



30 
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If a single logic source is used more than once (ie it is used in different logic sources, in 
which case, in the example in Figure 8, that LogicSource would be depicted more than 
once), then instances of a piece of logic contained within it which is used more than once 
will be regarded as distinct, if and only if the party mapping chains for the instances of the 
5 logic sources are distinct. 

As shown in Figure 10, party mapping can also be performed for XML source fragments 
imported into the main XML source document 812 via a SmartModule processing 
instruction. For example, a first XML fragment 1002 contains references to parties X and 
10 Y. In order to import this fragment 1002 into the main XML document 812, these parties 
are mapped to parties known to the main XML document 812, i.e., two of the parties PI, 
P2, and E. Similarly, a second fragment 1004 containing references to a party Z can be 
imported into the main XML document 812 by mapping the party Z to one of these three 
parties. « v.: 

15 

A Notes processing instruction is used to present J one or more notes to a user, via their 

browser display and/or in a printed document. For example: 

<?SpeedLegal 
<Notes> 

20 <Note Compietionlnstruction^'false" ShowExternalUsers^Talse" 

UserLevel="Non-Specialist"> 
<NoteTitle>Paid and Unpaid Leave</NoteTitle> 
<NoteBody><p>AII employees can request <b>unpaid</b> 
leave</pxp>On!y <b>Full-time</b> or <b>Part-time</b> 
25 employees who have accrued leave can request <b>paid</b> 

leave. </p></NoteBody> 
</Note> 
</Notes> 

?> 

30 

The Completionlnstruction attribute indicates whether the note provides information as to 
how the document should be completed; the ShowExternalUsers and UserLevel attributes 
allow the note to be shown or not shown, depending on whether the user matches the 
specified criteria. UserLevels can be defined arbitrarily. A user is defined to be external 
35 for the purposes of ShowExternalUsers if, in a role-based access control system, the user is 
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in an external role (ie external to the organisation running the document assembly system). 
Notes can also be attached to a question or a response of a UserTextQuestion or a' 
MultipleChoiceQuestion in a logic source. 

5 The Metalnformation processing instruction allows Metalnformation to be attached to an 
element in the source document. There is also a Metalnformation element defined in the 
grammar for a Logic Source. In this implementation, that definition uses concepts 
provided by RDF and Dublin Core. The format of the Metalnformation PI data is the same 
as the definition of the Metalnformation element in the Logic Source grammar. 

10 

A standard XML editor designed purely for editing XML documents only needs to concern 
itself with the document grammar, the user interface, and the XML document. In contrast, 
the editor 112 of the document assembly system performs standard editing functions but' 
also: 

1 5 (i) defines and manipulates logic in logic sources 2 1 0 according to the logic grammar 
2i4; ' 

(ii) refers to and manipulates references to logic sources 210 within XML source 
documents 208. This involves recognising the Processing Instructions described 
above and handling them as if their PI data were 'real' XML elements in the XML 

20 source document 208. It also includes ensuring that all of the logical interactions 

between a logic source 210 and the XML source Document 208 are 6 valid 5 . To do 
this, the editor 112 treats the source document 208 and the logic sources as if they 
were a single document, so that, for example, the IDREF in a Condition 
Processing Instruction to a Condition ID in a logic source can be validated; and 

25 (iii) presents a user interface to the user that allows an XML source document, logic 
source and logic references within an XML source document to be easily 
manipulated. 

As shown in Figure 12, the process of generating input files with the editor 112 of the 
30 document assembly system typically includes XML source document (without logic) 
creation and editing at step 1202, logic source content creation and editing at step 1204, 
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and then creation and editing of logic references within the XML source document at step 
1206. 

Figure 13 is a screenshot of a window display generated by the editor 1 12 during editing of 
5 a source XML document representing a letter. The window display includes a pull-down 
menu toolbar 1302, an icon toolbar 1304, and a document window 1306. The document 
window 1306 is vertically divided into a structure portion 1308 on the left and a detail 
portion 1310 on the right. The structure portion 1308 is itself vertically divided, with a 
hierarchical display 1312 of the XML elements of the document on the left, and an 

10 abbreviated description display 1314 of each element on the right. The detail display 1310 
includes text boxes 1316 to 1320 for displaying and entering XML elements and textual 
content. Pull down menu buttons 1324 to the left of each text box 1316 to 1320 allow the 
insertion of XML elements and processing instructions. To insert elements or processing - 
instructions within, a body of text the user Gan bring up a menu of allowable insertions by 

15 either using the mouse (right-clicking) or a key combination. This will insert the element - 
or processing instruction within the text at the current position of the cursor. The presence 
of a condition processing instruction associated with the content of a text box is indicated 
by the presence of a red dot on the corresponding pull down menu icon 1324. An orange 
dot on the icon 1324 denotes a Note. Thus in Figure 13 the first text box 1316, third text 

20 box 1318, and the fourth text box 1320 represent conditional text elements that may or 
may not be included in an instance document generated from this source document being 
edited depending upon the value of the condition associated with each text element. The 
second text box 1322 has a note that does not form part of the text of the instance 
document but is viewable in HTML and PDF renderings of the instant document, as a pop- 

25 up window, for example. 

Figure 14 is a screenshot of the editor 112 window display after definition of a multiple 
choice question. The variable associated with this question is named EmploymenMype, 
and three conditions, EmploymentJype.Casual, EmploymenMype.PartTime, and 
30 EmploymenMype. FullTime indicate whether an employee is employed on a casual, part- 
time, or full-time basis. A second multiple choice question, AmountOfLeave, is also 
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shown, with associated conditions AmountOf Leave. negotiate and AmountOfLeave.award. 
These conditions will be used to select the appropriate text for inclusion in an instance 
document depending upon the values of these variables, which are determined by 
responses received from a user. 
5 ' 

Figure 15 shows a screenshot of the editor 112 window display during definition of the 
question associated with the AmountOfLeave variable described above. An ID text box 
1502 allows the user to enter the ID name of the question, and a question text box 1504 is 
provided for the user to enter the actual text of the question that will be presented to the 

10 user. Note that, in this case, a party reference is included in the question text so that the 
employee's first name can be included in this text when it is displayed to an end-user. A 
pull down menu 1506 allows the user to specify whether the default answer to this question 
should be determined on the basis of a bias favouring a particular party, for example, 
employee or employer, or a predetermined default value. In this case, a bias has been 

15 selected, and consequently a bias button 1508 is displayed for each possible response to the 
question. A bias button 1508 is used to specify the degree to which the corresponding 
answer suits a particular party. If, during an interview round, an end-user specifies that the 
selected party is to be favoured, then the default answer to that question will be the one 
which best suits the selected party. This information, including bias and/or fixed default 

20 information, is included in the SelectionCriteria element associated with each response. 

A prompt text box 1510 is provided for editing each possible response (as it will be 
presented to the user) to this multiple choice question. Value text boxes 1512 allow the 
user to enter a variable value associated with each response. For example, if the award 

25 entitlements response was selected by a user, the MultipleChoiceQuestion or variable 
AmountOfLeave would have value f, award ,! , and the related conditions 
AmountOfLeave.award and AmountOfLeave. negotiate would have a values of true and 
false respectively. An "add notes" button 1514 is provided for each response, allowing the 
user to add note text which can is displayed during the corresponding interview to assist a 

30 user to select the appropriate response. 
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Within the editor 1 12, the user may also need to be able to reuse a chunk of XML with its 
associated logic. A reusable chunk of XML is referred to as a Module. 

A Module comprises: 



(ii) external-items (that is, items in the document but outside the text selected by the 
user that are referred to (generally by ID) by that text or by a reference from that 
reference and so on); and 

(iii) a LogicSources element declaring logic required by (i) and (ii). 
10 To make a Module available for re-use, the editor 1 12: 

(i) exports the element or text the user has selected; ' 

(ii) identifies and possibly exports external-items, such that the entire module is able 
to be made valid once imported into a target document; and . 

(iii) identifies all logic that the module references and ensures that that logic is in logic 
15 sources declared in the Module's LogicSources element.. : 

There are two ways that a user may desire the module to be used. In the first, a copy of the 
module is imported into the document - this allows the author to modify the text and/or 
logic as the module now forms part of the target document. For a module copy, the editor 
20 112: 

(i) imports the text - ensuring that it is valid in the target document. This may mean 
altering identifiers to allow the import to occur; 

(ii) imports or ensures that any external-items specified in the module are present in 
the target document. This may mean altering identifiers to allow the import to 

25 occur ; 

(iii) ensures the presence of (importing into a logic source used by the target document 
if necessary), all associated logic referred to by the text, external-items or from 
within the logic itself. This may mean altering identifiers to allow the import to 
occur ; and 

30 (iv) maps any Party references (if possible) in the text, logic and external-items to 



5 



(i) the XML element or text the user has selected; 



those used in the target document. 
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Alternatively, the user can import into the document a link to the module - this can point 
to a current version of the module, and any modifications subsequently made to the module 
are automatically reflected in the resulting document. For a module link, the editor 1 12: 

(i) creates a SmartModule processing instruction in the target document (which links 
5 the module to the target document) - ensuring that it would be valid if present in 

the target document. This can include providing a map of identifiers that need to 
change in order to allow the import to occur; 

(ii) displays the module to the user so that the text can be viewed (but not edited) as if 
it were part of the target document; 

10 (iii) ensures that any external-items required by the module are present in the target 
document (adding if necessary); 

(iv) adds logic sources required by the module to the target document LogicSources; 

(v) provides party mapping (if possible) so that the module can be imported into the 
target document; 

15 (vi) provides a means by which identifiers (declarations and references) in the module 
can be kept track of such that the target document cannot be invalidated when the 
module is imported into the target document by the Processing Engine 202. The 
editing application needs to track all identifiers used in the XML document 
including linked Modules preferably without having to import or fetch the Module 

20 every time. This requires: 

(a) keeping references to logic made from the linked Module 

(b) keeping references to links between Module elements and elements defined in 
the XML document 

(c) keeping references to declarations made within the linked Module 

25 

In some circumstances, it may be desirable to ensure that an instance XML document can 
be generated which conforms to the same grammar as the source document, for example, if 
the XML instance document is to be used outside the document assembly system (for 
example, edited in a standard XML editor). 

30 
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If it is possible to place conditions on any element in the source XML document, then there 
is no guarantee that the instance XML document will be valid. 

As an example, suppose the DTD (xhtml-letter-v2) used for the letter document ' above 
5 specified that a <letter> must have one and only one <LetterHead> child. Then, if a user 
were able to place a condition on the LetterHead, and in processing the source document 
through the Processing Module 202 that condition evaluated to false, the <LetterHead> 
child would be omitted from the instance XML document, with the result that the instance 
XML document would be invalid. 



XML grammars can express a variety of restrictions on the number of occurrences which 
are allowed for particular elements, for example zero or one, exactly once, and one or more 
times. If an instance document does not match a specified rule, it wilfcbe invalid. 

15 To ensure that the instance XML document is valid, a condition PI can only be attached to 
an element if: 

(i) the element is allowed zero or more times; or 

(ii) the element is allowed to occur zero or once; or 

(iii) the element is allowed to occur one or more times, and either: (a) one of the 
20 occurrences of the element does not have a condition attached, or (b) one of the 

conditions is always true when the others are false. 

In order to express more than one alternative for an element that is only allowed to occur 
zero or one times, one condition is applied to the element itself to determine whether the 
25 element appears at all in the instance XML document, and alternative condition Pis are 
applied to nodes inside the element (the same rules apply to the use of conditions on these 
nodes). If an element is to occur exactly once, and it is desired to allow several 
possibilities for its textual content, then conditional phrases are used inside the element - 
the editor 112 prevents a condition from being attached to the element itself. 



10 



30 
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Similarly, if an XML document contains an element with an ID, and another with an 
IDREF which points to it, then, if the element with the ID had a condition on it which 
evaluated to false, with the result that that element was not included in the instance XML 
document, and the element with the IDREF was included, the document would be invalid. 

5 

Accordingly, if the instance XML document is to be valid, then any element with an 
attribute of type IDREF must be subject to conditions which include all the conditions to 
which the element with the corresponding ID is subject. Whenever the conditions to which 
an element with an ID is subject are altered, the editor 1 12 should ensure all elements with 
1 0 an IDREF pointing to it are subject to appropriate conditions. 

After a source document 208 and its one or more logic sources 210 have been defined with 
the editor 112, they can be used by the processing engine 202 to generate one or more 
instance documents. The processing engine 202 begins processing an input XML 
15 document 208 when the Java controller servlet 21 8 creates an instance of the evaluatqr 402 
in response to a request from the user's web browser. In the preferred '^nibodiment, the 
constructor for the evaluator takes the input XML document as a document object model 
(DOM) object, but it could alternatively be passed as text, or as a URI from which it might 
be retrieved. 

20 

Later, the Evaluator 402 in conjunction with the evaluable node package 404 and the 
parties package 406 will identify questions that need to be answered. In order for it to be 
able to suggest a default answer or answers for each question which would result in text 
intended to favour one party to the document over another (or to automatically answer a 
25 question that is not a KeyQuestion), the Java servlet 218 may specify to the Evaluator 402 
which party to favour, as ascertained at step 504. 

The Evaluator 402 constructs two objects it will need later: operativeConditions, and 
operativelnterviewltems. It also extracts the logic from each logic source referred to in the 
30 document's LogicSources element, and from each logic source referred to in the 
LogicSources elements in each logic source, and so on (being careful not to get into an 



P:\OPER\DBW\specdksal 



i^jH^mblyprm.doc-l Mnrdi. 2tNi2 ^^^^^ 



-55- 

infinite loop). However, in the preferred embodiment, the Evaluator 402 does not extract 
the logic from a logic source imported by an XML fragment until the SmartModule 
processing instruction is encountered later. When the logic import is performed, the 
Evaluator 402 determines how the parties in that logic source are mapped, relative to the 
5 main XML document (the party mapping chain). Each Condition and Interviewltem (ie 
MultipleChoiceQuestion and UserTextQuestion - there are no ArrayRowIterators in a logic 
source) is stored in a hash table keyed by the URI of its logic file combined with its party 
mapping chain and its own ID, so that discrete instances (as explained above) can be 
stored, and they can be retrieved efficiently. 

10 

Finally, the Evaluator 402 extracts Party declarations from these LogicSources, creates a 
Party object to represent the Party declaration, and stores each Party object in the Parties 
object of the party package 406, keyed by the URI of its logic file combined with the ID of - 
the Party. Each Party object has "objects associated with it representing each Party Detail, 
15 and each PartyDetail has a means of representing each Value in it. The Value "element is 
stored in an object named PartyDetailValue. 

The invoking process (the Java Controller Servlet 218) then calls 
evaluator . initialisePartiesInUse () . The purpose of this method is firstly, 

20 to identify any UserTextQuestions in a Party XML object that will definitely need to be 
answered (Le. 9 irrespective of whether any conditions in the document evaluate to true or 
false), and secondly, to identify the Party XML objects for which the engine will need to 
know a gender and/or number. It is desirable to establish this up front, so that for an 
invoking process which interacts with an end-user, the questions relating to the parties can 

25 be put to the user before the other questions. One of the benefits of doing this is that words 
in those other questions that identify a party by using a PartyReference can be replaced 
with the value of that PartyReference. 

The method evaluator . initialisePartiesInUse () performs a breadth-first 
30 traversal of the DOM object. The breadth-first traversal is implemented recursively; 
recursion stops at any node that has a condition attached to it. Where in the course of 
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performing the breadth-first traversal, a PartyReference PI is encountered, getValue is 
invoked on the appropriate PartyDetail. That method is as follows: 



public String getValue ( Evaluator evaluator ) 
throws SpeedLegalLogicException { 

if (value ! =null) { 
return value; 

} 

boolean returnEarly = false; 

String number = party . getNumber () ; 
String gender = party . getGender () ;. 

if ( gender==null ) { 

if ( needToKnowGender ( ) ) { 



// Ask the user the gender question 
20 party . as kGender (evaluator ) 

returnEarly = true; 
} else { 

// If we don't know itaiid'don't need to know it, 
25 // because the only value provided is for 

"unspecified" 

gender = "Unspecified"; 

} 



} 



if ( number==null ) { 

if ( needToKnowNumber ( ) ) { 



// Ask the user the number question 
35 party . askNumber (evaluator) ; 

returnEarly = true; 
} else { 

number « "Unspecified"; 

40 } 
} 

if (returnEarly) { 
return null; 
45 } 



// If we get here, we're guaranteed some value for both 
gender and number 
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// Is there an exact match on gender and number? 
String key = PartyDetailValue . constructKey ( gender, 

number) ; 

5 if ( values. get ( key .) !=null ) { 

value = ( (PartyDetailValue) values. get ( key 
) ) .getValue ( evaluator ) ; 

return value; 

} 

10 

// Failing that, fall back to partial matches in some 
circumstances . 



15 // Match on Gender first, but could do Number first 

if we wanted. . 

// GENDER 

// if we wanted to prohibit match on Unspecified number 
20 whenever there 

// was a value for either but not both Singular and 
plural, we'd .. » 

// have if ! needToKnowNumber ( ) here... 

25^ key = PartyDetailValue . constructKey ( gender, 

"Unspecified"); 

if ( values. get ( key )!=null ) { 

value = ( (PartyDetailValue) values .get ( key 
) ) .getValue ( evaluator ) ; 
30 return value; 

} 

// NUMBER 

//if ( IneedToKnowGender () ) { - see comments on Gender 

35 above 

key = PartyDetailValue. constructKey ( "Unspecified", 

number) ; 

if ( values. get ( key ) !=null ) { 

log. debug ("Partial match on number."); 
40 value = ( (PartyDetailValue) values . get ( key 

) ) .getValue ( evaluator ) ; 

return value; 

} 

45 

// What about a match on completely unspecified? 
//if ( ! needToKnowNumber () && ! needToKnowGender ( ) ) { 
log. debug ("Trying to match unspecified number and 

gender. ") ; 

50 key - PartyDetailValue . constructKey ( "Unspecified", 

"Unspecified") ; 

if ( values. get ( key ) !=null ) { 



P^PER\DB\V\spccdk8af^^^PiWyprtn-.doc-l March. Hit\2 



-58- 



log. debug ("Match on default only.."); 
value = ( (PartyDetailValue) values .get ( key 
) ) .getValue ( evaluator ) ; 

return value; 

5 } 



// We could get here if the document author didn't provide 
Unspecified number 
10 // and Unspecified gender, but only, say Female Singular, 

// and the end user. chose say Male Singular, 

log. error ("No match, not even on Unspecified."); 

value = "[* No match for party: " + party . getKey ( ) + " , 
detail: " + id 

15 + ", of gender f " + gender + " ' and number f " 

+ number + " f . Document author must fix this. *]" ; 
return value; 



20 Where necessary, by invoking other methods, getValue causes Interviewltems to be 
created which represent the questions that need to be answered. The questions are either 
MultipleChoiceQuestions that seek to establish the Gender (Male, Female or Neuter) or 
Number (Singular or Plural) of a Party, or UserTextQuestions that prompt for text forming 
part of the value of the PartyDetail for the matching Gender and Number (where 

25 'matching 1 is determined by the procedure implemented by the getValue method). 



When the Evaluator 402 has finished doing this, the invoking process checks to see 
whether there are any Interviewltems that need to be resolved. If there are no 
Interviewltems that need to be resolved, Evaluator.evaluateO is invoked. It extracts any 
30 Interviewltems from the document which need to be resolved at this point. Its function is 
described in detail below. 



If there are still no interview items that need to be resolved, then the input document is a 
trivial case, and the instance XML document would be identical. Otherwise, the invoking 
35 process resolves the interview items. 



In the embodiment described here, the Java controller servlet 218 creates an HTML form 
for a user to complete in their web browser. However, it will be apparent that the 
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controller servlet could seek responses from some other source, for example, a database 
which it queries using SQL, a web service, or some application using its API. In such an 
embodiment, a means would be provided for a logic source to specify a preferred method 
for resolving either all interview items, or some particular interview item, together with the 
5 details required in order to do so (eg URL, authentication information, query etc) which 
the invoking process would then attempt to honour. It will be apparent that the invoking 
process would need tb be sure to return a response for each interview item, even if it 
couldn't get a response using the preferred method. 

10 Having . resolved the interview items, the invoking process calls Evaluator 402 f s 
evaluate () method, passing it a set of interview items that contain values, 
evaluate () first transfers the answers from the interview items proffered by the 
invoking process to any matching interview items for which it was awaiting responses. 
Because there is a flag on each Party Detail and each Condition object which ensures that 

15 the Evaluator 402 only attempts to evaluate it once each time the invoking process calls 
Evaluator 4 02 1 s evaluate ( ) method (hereinafter called "interview round"), 
these flags are reset. 

As described above, the Evaluator 402 performs a recursive breadth-first traversal of the 
20 DOM object. During the course of the breadth-first traversal, certain nodes will be 
encountered that require special handling. In this embodiment, these nodes are all 
processing instructions so that they can be included in an XML document without affecting 
its validity. 

25 PartyReference processing instructions are treated as described above. It is possible for 
the Evaluator 402 to encounter a PartyReference to a PartyDetail not previously 
encountered (because it occurs below an element to which a condition is attached). As 
before, getValue is invoked on the appropriate PartyDetail. 

30 It will be apparent that in embodiments of the present invention, any node could be given 
special handling - either by node name, or node type, in either all documents or only 
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documents using a specified DTD, or indeed by position in the document. In the preferred 
embodiment, the traversal method calls an empty method which can be overridden in a 
subclass of Evaluator to implement this special handling if desired. 

5 When a node that has a condition processing instruction is encountered, the Evaluator 402 
attempts to evaluate it. First, it looks to see whether that condition has been encountered 
already (if it has, an object representing it will be stored in the operativeConditions object). 
If the condition has not been encountered already, an object that represents it will be 
constructed and stored in the operativeConditions object. Then, if the condition is not fully 
10 evaluated, and if the relevant flag indicates that no attempt has been made to evaluate the 
condition in this interview round, the Evaluator 402 attempts to evaluate the Condition. 

The Condition is evaluated as follows. Each of the elements that can appear in a Condition 
- the operators And, Or, Not, Test, UseCbhdition, and the values True and Falss - 

15 implements an interface that defines a method named evaluateO (not- to be confused with - 
Evaluator 402 or its evaluateO method); and has a constructor with the same signature. 
Because of this, in this embodiment, the Condition object can use a Java programming 
technique known as reflection to construct objects representing its children, and call their 
evaluateO method. In turn, if a child of an object is And, Or, Not, Test, or UseCondition, 

20 the objects can recursively construct and call evaluateO on 311 object representing that 
child, so that ultimately, each of And, Or, and Not, has arguments that are boolean values 
that can be evaluated to true or false using classical boolean logic. Finally, Condition(True) 
evaluates to True, and Condition(False) evaluates to False. 

25 If a Test node is encountered, then when its evaluateO method is invoked, the Test object 
checks to see whether the MultipleChoiceQuestion it refers to has been instantiated as an 
object (it will have been if some other Test node has already been looked at which refers to 
the same MultipleChoiceQuestion). If the MultipleChoiceQuestion has not been 
instantiated, a MultipleChoiceQuestion object representing it is constructed. 

30 
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If the invoking process has indicated that only Key Questions should be asked, and this 
question is not a Key question, then the processing engine 202 can evaluate it 
automatically using the scoring procedure based around the SelectionRules and 
SelectionCriteria for that MultipleChoiceQuestion (in this implementation, a method in the 
5 object representing the MultipleChoiceQuestion in the interview item package 408 does 
this). Where a new MultipleChoiceQuestion has been constructed, it will be added to 
operativelnterviewltems. Each Interviewltem (a MultipleChoiceQuestion is a type of 
Interviewltem) in operativelnterviewltems has status either "pending" or "known". 

10 If a MultipleChoiceQuestion has status pending, then the Evaluator 402 waits for the 
invoking process to provide it with a response value before it evaluates the Test that uses it 
to true or false. Interviewltems with status pending are not returned to the invoking 
process as they are created, but rather, are only returned when the traversal has been 
completed. This makes it easy for aii invoking process that interacts with a user "to put 

15 several questions to the user" at once, perhaps organised by the value of the 
. MultipteChoiceQuestion's Topic child. 

If the MultipleChoiceQuestion has status known, then Test's Evaluate method will return 
true if the MultipleChoiceQuestion has a response equal to the value specified in the Test, 
20 and false otherwise. 

If the condition evaluates to true, then the traversal can descend into the node to which the 
condition applies. If it evaluates to false, that node will not be included in the instance 
XML document (To assist the rendering engine 204 later on, the evaluate method of the 

25 Evaluator 402 adds an attribute named "diedlnEvaluation" to an element that had a 
■ condition which evaluated to false.). If the condition cannot be evaluated to either true or 
false, the traversal does not in this interview round descend into the node to which the 
condition applies. Instead, any pending interview items (created as described above) will 
in due course be returned to the invoking process for resolution, so that, assuming 

30 responses are provided, the condition may be evaluated in the next interview round. 
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When an InsertReuseablePhrase processing instruction is encountered, the Evaluator 402 
retrieves the ReusablePhrase node it refers to from the relevant logic source, and 
physically replaces the processing instruction node in the DOM object with that 
ReusablePhrase node. It then puts an IDREF to that logic source on relevant descendants 
5 of the ReusablePhrase node, and converts any logic elements in those descendants from 
the XML format used in the logic file to the Processing Instruction format appropriate to 
the XML source document. It is then possible to traverse the descendants of the 
ReuseablePhrase node in the usual way, albeit in a context provided by the party mapping 
chain for the logic source it came from (so Conditions and Interviewltems can be looked 
10 up correctly). 

A consequence of this way of processing an InsertReuseablePhrase processing instruction 
is that a single ReusablePhrase node is inserted into the document and traversed multiple 
times if there is more than one InsertReusablePhrase processing instruction that points to 
15 it. In an alternative embodiment, each ReusablePhrase node could be traversed just once, 
and the result inserted into the document at a later step. 

When an InsertUserText processing instruction is encountered, the Evaluator 402 
determines whether a UserTextQuestion object representing the UserTextQuestion the 
20 processing instruction points to already exists in operativelnterviewltems. If it does not, 
the Evaluator 402 creates a new UserTextQuestion object, and adds it to 
operativelnterviewltems with status "pending". 

When an ArrayRowlterator element is encountered, the processing engine 202 first checks 
25 to see whether its "repeat" attribute has a known positive integer value. If it does not, then 
the ArrayRowlterator cannot be processed any further until the Interviewltem named in that 
attribute has a value (it is an error if that value is not a positive integer). Once the "repeat" 
attribute has a known positive integer value, a two-dimensional array of Interviewltems is 
created. The first dimension is the Interviewltems 'identified in the ArrayReference 
30 processing instruction descendants of the parent of the ArrayRowlterator; the second 
dimension is the number of rows specified by the "repeat" attribute. The 
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ArrayRowIterator object containing that two-dimensional array of interview items is added 
to operative interview items, and resolved in the usual way. 

A situation can arise whereby two ArrayRowlterator processing instructions contain 
5 ArrayReference processing instructions pointing to the same interview item. For example, 
suppose a first ArrayRowlterator has a repeat attribute value of nl, and the second n2. In 
the preferred embodiment, the Interview Items in the abs(n2-nl) rows common to both 
ArrayRowlterators are then asked only once. However, alternative embodiments could 
require independent values for the Interviewltems in those rows. 

10 

When the breadth-first traversal is complete, the evaluate method of the Evaluator 402 
returns control to the invoking process. The invoking process then asks the Evaluator 402 
for all operativelnterviewltems that have status pending. If there are none, then the 
document has been fully evaluated, and ah instance document is created by the Rendering 1 
15 Engine 204. If there are operativelnterviewltems with status pending^ the- invoking process 
determines a response value for each of those interview items, as described above 
following evaluator. initialisePartiesInUse ( ) . 

Having resolved the interview items, the invoking process callsevaluate method of 
20 Evaluator 402, passing it a set of interview items which contain values. This cycle 
repeats, until such time as there are no pending interview items. 

When there are no pending interview items, an instance document in XML, PDF, RTF, 
HTML, or some other format, can be created from the DOM document object which the 
25 processing engine 202 has been manipulating. 

As the rendering engine 204 creates an instance document, elements of the source XML 
document with conditions that evaluated to false (and therefore had an attribute named 
"diedlnEvaluation") are omitted, each InsertUserText processing instructions is replaced 
30 by the value of the corresponding UserTextQuestion, the ReusablePhrases have replaced 
the InsertReusablePhrase processing instructions, and so on. This is done using XSLT 
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with Xalan extension functions, which get the values of UserTextQuestions and 
PartyReferences from the evaluator. In alternative embodiments, an extension function 
could look up the value of the condition, instead of relying on the presence or absence of 
the diedlnEvaluation attribute. An alternative to using XSLT for most of the rendering 
5 engine's work would be to programmatically do another recursive traversal. 

Of particular interest is the transform to HTML. The instance document can be rendered 
in such a way that any text that appears as a result of one or more conditions evaluating to 
true, or as a result of a response to a UserTextQuestion, is generated by the rendering 

10 engine 204 as a hypertext anchor. When that anchor is activated, the user interface 
f displays the question and response that resulted in the anchor text being included, as 
described above. In the case of text that is subject to one or more conditions, it would be 
possible to display the questions associated with each, condition. However, in the preferred 
embodiment, only the questions associated with the closest ancestor having a condition 

15 processing instruction are displayed. 

When the anchor is activated, it is also possible to reset one or more of the questions 
associated with the condition (which means the condition should be re-evaluated, so that 
the question is asked again), or, in the case of a UserTextQuestion, to ask it again. Where 
20 a question associated with a condition is reset, any other condition which used that 
MultipleChoiceQuestion is also reset, as is any other condition which used that condition, 
and so on. 

In the user interface available to a user of the document assembly system via their web- 
25 browser, when a user moves their mouse over the relevant text, JavaScript and the anchor 
element's HTML onMouseOver event is used to generate a pop-up window displaying the 
relevant questions and responses; when a user clicks on the anchor, the Processing Engine 
202 is invoked to reset the relevant questions. 
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The HTML rendering of the document also typically provides a list of all the questions that 
were answered, together with the user's responses. If the user selects a question link, the 
Processing Engine 202 resets that question as described above. 

5 Many modifications will be apparent to those skilled in the art without departing from the 
scope of the present invention as herein described with reference to the accompanying 
drawings. For example, by defining additional processing instructions, the preferred 
embodiment can easily be extended to add other features to the document assembly 
system. 



The reference to any prior art in this specification is not, and should not be taken as, an 
acknowledgment or any form of suggestion that that prior art forms part of the common 
general knowledge in Australia. 
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DATED this 1 st day of March 2002 



SPEEDLEGAL HOLDINGS INC 
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By its Patent Attorneys 
DAVIES COLLISON CAVE 
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. ■/*.. Letter of employment 

Work Hours 

FOR THE QUESTION 

* Does SpeedLefl&J permit flexible work hours in relation to Jason? 
YOU SAID 

* Y« 

604 

Aiwhat are Speed Legal 's normal office hours? 

is-— 606 

What are the core hours when Jason is expected to attend the office? 

. 606 

\70at7i - 12pm and 2pm - ^pav J 
Restraint 

FOR THE QUESTION - . '. , 

< Win Jason $e tea trained it om working for competitors of SpeedLegd eJTei lh* end of Jason's employment ? 

yousaip .. ' . ' ~ r ~ ,: 

* Yes \'i . . , • >'■ ? ' ■ ]' ' : \ ■"*■; 
V/hat is the duration of ttte restraint imposed on Jason? © 



| 6 months j\ 

Please define the geographic area and market of the restraint 



Australian, US, UK or global martet for tega! technology 



©NEXT 

^608 
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708 " Produce ani 4dobs Acrobat/POf document, containing: 



The legal document itself 
V — Editorial nates pertaining to the use of the document 

P" The interview questions on which the document Is based 

The document is write-protected 
710 ^Create I 70 6 

7 1 r-gave t/oor answers to the SmartRepository 
' AZ> . Start over with another SmartPrecedent 
. start over with another Answer Ffle 

14 February 2002 ,702 



Jason Harrop 
Unit 1,20-24 Bosi: 
Richmond 3121 



Dear Jason 



What is the date of 
this letter? ' 



. 14 February 2002 



704 



Employment with SpeedLegal 

On behalf of SpeedLegal. I am pleased to confirm the terms of your employment with us. 

1. Position and Duties „ 

; A. 

1 .1 You will be employed on a' full time basis, commencing on 15 February 2002 . 

1 .2 You will be appointed to the position of Programmer , reporting to Jamie Wodetzki . 

1 .3 Your duties and responsibifities will include product design and development and such other 
related services as notified from time to time. 
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XML Document 

Manipulate document 
components 

e.g. adding some 
paragraph text "I refer to 
your recent letter" 



1202 



r 
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Define Logic 

Add and manipulate 
logic within a logic 
source 

e.g. adding some 
reusable text — a 
ReusablePhrase 



1206 



Add Logic to 
Document 

Add and manipulate 
logic within document 
source 

e.g. inserting a 
Condition and the 
ReusablePhrase into the 
XML Document 



<LetterHeacL> 

<Date>30 January 2002</Date> 
<AddresseeDetails ID=" JoeCitizen"> 
•CTitle>Mr</Title> 
< Fi r s t name > Joe </ Fi r s t name * 
<Surname>Citizen</Surname> 
< Postal Address> 

<AddressLine>GPO Box 1866R</AddressLine> 
<County>Melbourne</County> 
<State>Vic</State> 
<Postalcode>3000</P6 / stalcode> 
<Country>Australia</Country> 
</PostalAddress> 
</AddresseeDetails> 
<Salutation>Dear Joe</Salutation> 
</LetterHead> 
<.LetterBody*> 

<p>I refer to your recent letter. </p> 
<p>As the items you have ordered have now 
manufactured, we will ship them to you *over night . </p> 
«:/LetterBody> 



been 




<MultipleChoiceQuestion ID=»"response M > 
<QuestionType KeyQuestion~"true"/ v 

<Question>Is this correspondence in response to a 
letter you have received?</Question> 
<Topic>Setup</Topic> 

<SelectionRules AnswerUsing="Bias" 
Cardinality="Single" Device«"Checkbox"/*> 
<Responses> 
<Response> 

^SerectionCriteria Default^'Unchecked"/* 
_ . . _ — -:Prompti-Yes</Prompt> 

<SetValueTo>yes</SetValueTo > 
*./Response> 
<Response> 

<SelectionCriteria Def ault=»"Unchecked"/> 
<Prompt>No</Prompt> 
<SetValueTo >no< /SetValueTo > 
<■ /Response^ 
^/Responses"-* 
</MultipleChoiceQuestion> 

^-Condition ID- "response .yes" > 

«:Test IDREF=" response" Value="yes"/> 
*VCondition> 

<Condition ID="response.no"> 

<.Test IDREF-" response" Value-"no" /*> 
< /Condi tion> 
<Reus able Phrase ID= " itemDe scrip t ion" > 

<Condi tionalPhr as e 
Condi tion='» quantity . pallet ts " >bulk< /Condi tionalPhr as e> 
<InsertUserText IDREF="whatItem n /> 
</ReusablePhrase> 




'./LetterHead> 
<LetterBody> 
<p> 

<?SpeedLegal <Condition 
uri«"/f iles/logic/pa tent-sample . Igc" />?>! 
letter. </p> 

<p>As the <?SpeedLegal 

IDREF=="itemDescription" uxi= 
s ample. Igc 11 />?> you have ordered have now 
will ship them to you overnight . </p> 
'VLetterBody* 



IDREF= " response . yes 11 
refer to your recent 

<InsertReusablePhrase 
^"/files/logic/patent- 
been manufactured, we 



Figure 12 
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